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Introduction 



0.1 About this book 

This book is a one semester course in basic analysis. It started its life as my lecture notes for 
teaching Math 444 at the University of Illinois at Urbana-Champaign (UIUC) in Fall semester 2009. 
Later I added the metric space chapter to teach Math 521 at University of Wisconsin-Madison 
(UW). A prerequisite for the course should be a basic proof course, for example using [H], [F], or 
the unfortunately rather pricey [DW]. 

It should be possible to use the book for both a basic course for students who do not necessarily 
wish to go to graduate school (such as UIUC 444), but also as a more advanced one-semester course 
that also covers topics such as metric spaces (such as UW 521). Here are my suggestions for what 
to cover in a semester course. For a slower course such as UIUC 444: 

§0.3, §1.1-§1.4, §2.1-§2.5, §3.1-§3.4, §4.1-§4.2, §5.1-§5.3, §6.1-§6.3 

For a more rigorous course covering metric spaces that runs quite a bit faster (such as UW 521): 

§0.3, §1.1-§1.4, §2.1-§2.5, §3.1-§3.4, §4.1-§4.2, §5.1-§5.3, §6.1-§6.2, §7.1-§7.6 

It should also be possible to run a faster course without metric spaces covering all sections of 
chapters through 6. The approximate number of lectures given in the section notes through chapter 
6 are a very rough estimate and were designed for the slower course. 

The book normally used for the class at UIUC is Bartle and Sherbert, Introduction to Real 
Analysis third edition [BS]. The structure of the beginning of the book somewhat follows the 
standard syllabus of UIUC Math 444 and therefore has some similarities with [BS]. A major 
difference is that we define the Riemann integral using Darboux sums and not tagged partitions. 
The Darboux approach is far more appropriate for a course of this level. 

Our approach allows us to fit within a semester and still spend some extra time on the interchange 
of limits and end with Picard's theorem on the existence and uniqueness of solutions of ordinary 
differential equations. This theorem is a wonderful example that uses many results proved in the 
book. For more advanced students, material may be covered faster so that we arrive at metric spaces 
and prove Picard's theorem using the fixed point theorem as is usual. 

Other excellent books exist. My favorite is Rudin's excellent Principles of Mathematical 
Analysis [R2] or as it is commonly and lovingly called baby Rudin (to distinguish it from his other 
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great analysis textbook). I have taken a lot of inspiration and ideas from Rudin. However, Rudin 
is a bit more advanced and ambitious than this present course. For those that wish to continue 
mathematics, Rudin is a fine investment. An inexpensive and somewhat simpler alternative to Rudin 
is Rosenlicht's Introduction to Analysis [Rl]. There is also the freely downloadable Introduction to 
Real Analysis by William Trench [T]. 

A note about the style of some of the proofs: Many proofs traditionally done by contradiction, 
I prefer to do by a direct proof or by contrapositive. While the book does include proofs by 
contradiction, I only do so when the contrapositive statement seemed too awkward, or when 
contradiction follows rather quickly. In my opinion, contradiction is more likely to get beginning 
students into trouble, as we are talking about objects that do not exist. 

I try to avoid unnecessary formalism where it is unhelpful. Furthermore, the proofs and the 
language get slightly less formal as we progress through the book, as more and more details are left 
out to avoid clutter. 

As a general rule, I use := instead of = to define an object rather than to simply show equality. 
I use this symbol rather more liberally than is usual for emphasis. I use it even when the context is 
"local," that is, I may simply define a function f{x) :=x^ for a single exercise or example. 

Finally, I would like to acknowledge Jana Maffkova, Glen Pugh, Paul Vojta, Frank Beatrous, 
and Sonmez §ahutoglu for teaching with the book and giving me useful feedback. Frank Beatrous 
wrote the University of Pittsburgh version extensions, which served as inspiration for many of the 
recent additions. I would also like to thank Dan Stoneham, Jeremy Sutter, Eliya Gwetta, Daniel 
Alarcon, Steve Hoerning, Yi Zhang, Nicole Caviris, an anonymous reader, and in general all the 
students in my classes for suggestions and finding errors and typos. 
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0.2 About analysis 

Analysis is the branch of mathematics that deals with inequalities and limits. The present course 
deals with the most basic concepts in analysis. The goal of the course is to acquaint the reader with 
rigorous proofs in analysis and also to set a firm foundation for calculus of one variable. 

Calculus has prepared you, the student, for using mathematics without telling you why what 
you have learned is true. To use, or teach, mathematics effectively, you cannot simply know what is 
true, you must know why it is true. This course shows you why calculus is true. It is here to give 
you a good understanding of the concept of a limit, the derivative, and the integral. 

Let us use an analogy. An auto mechanic that has learned to change the oil, fix broken 
headlights, and charge the battery, will only be able to do those simple tasks. He will be unable to 
work independently to diagnose and fix problems. A high school teacher that does not understand 
the definition of the Riemann integral or the derivative may not be able to properly answer all the 
student's questions. To this day I remember several nonsensical statements I heard from my calculus 
teacher in high school, who simply did not understand the concept of the hmit, though he could "do" 
all problems in calculus. 

We start with a discussion of the real number system, most importantly its completeness property, 
which is the basis for all that comes after. We then discuss the simplest form of a limit, the limit of 
a sequence. Afterwards, we study functions of one variable, continuity, and the derivative. Next, we 
define the Riemann integral and prove the fundamental theorem of calculus. We discuss sequences 
of functions and the interchange of limits. Finally, we give an introduction to metric spaces. 

Let us give the most important difference between analysis and algebra. In algebra, we prove 
equalities directly; we prove that an object, a number perhaps, is equal to another object. In analysis, 
we usually prove inequalities. To illustrate the point, consider the following statement. 

Let xbe a real number. IfO<x<eis true for all real numbers £ > 0, then x = 0. 

This statement is the general idea of what we do in analysis. If we wish to show that = 0, we 
show that < jc < e for all positive e. 

The term real analysis is a little bit of a misnomer. I prefer to use simply analysis. The other 
type of analysis, complex analysis, really builds up on the present material, rather than being distinct. 
Furthermore, a more advanced course on real analysis would talk about complex numbers often. I 
suspect the nomenclature is historical baggage. 

Let us get on with the show. . . 



8 



INTRODUCTION 



0.3 Basic set theory 

Note: 1-3 lectures (some material can be skipped or covered lightly) 

Before we start talking about analysis we need to fix some language. Modern* analysis uses the 
language of sets, and therefore that is where we start. We talk about sets in a rather informal way, 
using the so-called "naive set theory." Do not worry, that is what majority of mathematicians use, 
and it is hard to get into trouble. 

We assume that the reader has seen basic set theory and has had a course in basic proof writing. 
This section should be thought of as a refresher. 

0.3.1 Sets 

Definition 0.3.1. A set is a collection of objects called elements or members. A set with no objects 
is called the empty set and is denoted by (or sometimes by {}). 

Think of a set as a club with a certain membership. For example, the students who play chess 
are members of the chess club. However, do not take the analogy too far. A set is only defined by 
the members that form the set; two sets that have the same members are the same set. 

Most of the time we will consider sets of numbers. For example, the set 

5:= {0,1,2} 

is the set containing the three elements 0, 1, and 2. We write 

les 

to denote that the number 1 belongs to the set S. That is, 1 is a member of S. Similarly we write 

to denote that the number 7 is not in S. That is, 7 is not a member of S. The elements of all sets 
under consideration come from some set we call the universe. For simplicity, we often consider the 
universe to be the set that contains only the elements we are interested in. The universe is generally 
understood from context and is not explicitly mentioned. In this course, our universe will most 
often be the set of real numbers. 

While the elements of a set are often numbers, other object, such as other sets, can be elements 
of a set. 

A set may contain some of the same elements as another set. For example, 

r:={0,2} 

contains the numbers and 2. In this case all elements of T also belong to S. We write T C S. More 
formally we have the following definition. 



*The term "modem" refers to late 19th century up to the present. 
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Definition 0.3.2. 

(i) A set A is a subset of a set 5 if x G A implies that x e B, and we write A C B. That is, all 
members of A are also members of B. 

(ii) Two sets A and B are equal if A C 5 and B C A. We write A = B. That is, A and B contain 
exactly the same elements. If it is not true that A and B are equal, then we write A^B. 

(iii) A set A is a proper subset of 5 if A C 5 and A^B. We write AC.B. 

When A = 5, we consider A and B to just be two names for the same exact set. For example, for 
S and T defined above we have T C S, but T ^ S. So T is a proper subset of S. At this juncture, we 
also mention the set building notation, 

{xeA:P{x)}. 

This notation refers to a subset of the set A containing all elements of A that satisfy the property 
P{x). The notation is sometimes abbreviated (A is not mentioned) when understood from context. 
Furthermore, x e A is sometimes replaced with a formula to make the notation easier to read. 

Example 0.3.3: The following are sets including the standard notations. 

(i) The set of natural numbers, N := {1,2, 3, . . .}. 

(ii) The set of integers, Z := {0, — 1, 1, —2,2, . . .}. 

(iii) The set of rational numbers, Q := -.m^neZ and n ^ 0}. 

(iv) The set of even natural numbers, {2m : m e N}. 

(v) The set of real numbers, R. 
Note that N C Z c Q C R. 

There are many operations we want to do with sets. 

Definition 0.3.4. 

(i) A union of two sets A and B is defined as 

AUB:={x:xeAorxeB}. 

(ii) An intersection of two sets A and B is defined as 

An5 := {jc : jc e A andjc e B}. 

(iii) A complement ofB relative to A (or set-theoretic difference of A and B) is defined as 

A\B:^{x:xeA andx^B}. 
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(iv) We say complement of B and write B'^ if A is understood from context. The set A is either the 
entire universe or is the obvious set containing B. 

(v) We say that sets A and B are disjoint if A fl 5 = 0. 

The notation B'^ may be a little vague at this point. If the set 5 is a subset of the real numbers M, 
then B^ means M \ 5. If 5 is naturally a subset of the natural numbers, then B^ is N \ 5. If ambiguity 
would ever arise, we will use the set difference notation A\B. 










A 




B 









AUB 











B 








A\B B' 
Figure 1: Venn diagrams of set operations. 



We illustrate the operations on the Venn diagrams in Figure 1 . Let us now establish one of most 
basic theorems about sets and logic. 

Theorem 0.3.5 (DeMorgan). Let A, B,C be sets. Then 

(BUCf = B^nc'', 
{Bncf = b''[jC, 

or, more generally, 

A\(5UC) = (A\5)n(A\C), 
A\{BnC) = (A\5)U(A\C). 
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Proof. The first statement is proved by tlie second statement if we assume that set A is our "universe." 

Let us prove A \ (5 U C) = (A \ 5) fl (A \ C) . Remember the definition of equality of sets. First, 
we must show that if x G A \ (5 U C) , then x e {A\B)n{A\C). Second, we must also show that if 
xe (A\5)n(A\C),thenxeA\(5UC). 

So let us assume that x e A \ (B U C). Then x is in A, but not in B nor C. Hence x is in A and not 
in 5, that is, X gA\5. Similarly x G A \ C. Thus x G (A\5)n(A\C). 

On the other hand suppose that x G (A \ 5) fl (A \ C) . In particular x G {A\B) and so x G A and 
x^B. Also as X G (A \ C), then x^C. Hence x G A \ (5 UC). 

The proof of the other equality is left as an exercise. □ 

We will also need to intersect or union several sets at once. If there are only finitely many, then 
we simply apply the union or intersection operation several times. However, suppose that we have 
an infinite collection of sets (a set of sets) {Ai,A2,A3, . . .}. We define 

oo 

1^ A„ := {x : X G A„ for some n G N}, 

n=l 

OO 

Pi A„ := {x : X G A„ for all n G N}. 

n=l 

We can also have sets indexed by two integers. For example, we can have the set of sets 
{^1,1,^1,2,^2,1,^1,3,^2,2,^3,1, . . .}. Then we write 



oo oo 



n=\m=\ n=\ \m=l 

And similarly with intersections. 

It is not hard to see that we can take the unions in any order. However, switching unions and 
intersections is not generally permitted without proof. For example: 



oo oo 



U Pi {A: G N : mifc < n} = IJ = 0. 

n=\m=\ n=l 



However, 



oo oo 



f] \J{keN:mk<n}= f]N = N. 

m=\n=\ m=l 



0.3.2 Induction 



A common method of proof is the principle of induction. We start with the set of natural numbers 
N = {1,2,3,...}. The natural ordering on N (that is, 1<2<3<4<---) has a wonderful property. 
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The natural numbers N ordered in the natural way possess the well ordering property. We take this 
property as an axiom. 

Well ordering property of N. Every nonempty subset ofN has a least (smallest) element. 

By 5 C N having a least element, we mean that there exist anxE S, such that for every y E S, 

we have x <y. 

The principle of induction is the following theorem, which is equivalent to the well ordering 
property of the natural numbers. 

Theorem 0.3.6 (Principle of induction). Let P{n) be a statement depending on a natural number n. 
Suppose that 

(i) (basis statement) P(l) is true, 

(ii) (induction step) ifP{n) is true, then P{n+ 1) is true. 
Then P{n) is true for all n&N. 

Proof. Suppose that S is the set of natural numbers m for which P{m) is not true. Suppose that S is 
nonempty. Then S has a least element by the well ordering property. Let us call m the least element 
of S. We know that 1 ^ 5 by assumption. Therefore m > 1 and m — 1 is a natural number as well. 
Since m was the least element of S, we know that P{m — 1) is true. But by the induction step we see 
that P{m 1 + 1) = P{m) is true, contradicting the statement that me S. Therefore S is empty and 
P{n) is true for all n G N. □ 

Sometimes it is convenient to start at a different number than 1 , but all that changes is the 
labeling. The assumption that P{n) is true in "if P{n) is true, then P{n + 1) is true" is usually called 
the induction hypothesis. 

Example 0.3.7: Let us prove that for all n e N we have 

2"-^<nl 

We let P{n) be the statement that 2"~^ < n! is true. By plugging in n = 1, we see that P(l) is true. 

Suppose that P{n) is true. That is, suppose that 2"~^ < n\ holds. Multiply both sides by 2 to 
obtain 

2"<2(n!). 

As2< (n+1) whenneN, wehave2(n!) < (n + l)(n!) = (n + 1)!. That is, 

2"<2(n!) < (n+1)!, 

and hence P{n + 1) is true. By the principle of induction, we see that P{n) is true for all n, and 
hence 2"~^ < n! is true for all n e N. 
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Example 0.3.8: We claim that for all c 7^ 1, we have that 

1 — c""*"^ 

l+c + c^ + --- + c" = 



1-c 

Proof: It is easy to check that the equation holds with n— I. Suppose that it is true for n. Then 
l + c + c^ + --- + c" + c"+^ = il+c + c^ + --- + c")+c"+^ 

1 — c 

_ l-c"+i + (l-c)c''+i 
1 - c"+2 



1-c 



There is an equivalent principle called strong induction. The proof that strong induction is 
equivalent to induction is left as an exercise. 

Theorem 0.3.9 (Principle of strong induction). Let P{n) be a statement depending on a natural 
number n. Suppose that 

(i) (basis statement) P(l) is true, 

( ii) (induction step) ifP{k) is true for allk= 1 , 2, . . . , n, then P{n+l) is true. 
Then P{n) is true for all n&N. 



0.3.3 Functions 

Informally, a set-theoretic function f taking a set A to a set 5 is a mapping that to each x eA 
assigns a unique y eB. We write /: A — > 5. For example, we define a function f:S^T taking 
S = {0, 1,2} to r = {0,2} by assigning /(O) := 2, /(I) := 2, and /(2) := 0. That is, a function 
/ : A — 7> 5 is a black box, into which we stick an element of A and the function spits out an element 
of B. Sometimes / is called a mapping and we say that / maps A to B. 

Often, functions are defined by some sort of formula, however, you should really think of a 
function as just a very big table of values. The subtle issue here is that a single function can have 
several different formulas, all giving the same function. Also, for many functions, there is no 
formula that expresses its values. 

To define a function rigorously first let us define the Cartesian product. 

Definition 0.3.10. Let A and B be sets. Then the Cartesian product is the set of tuples defined as 
follows. 

A x5 := {{x,y) :jc eA,j e5}. 
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For example, the set [0, 1] x [0, 1] is a set in the plane bounded by a square with vertices (0,0), 
(0, 1), (1,0), and (1,1). When A and B are the same set we sometimes use a superscript 2 to denote 
such a product. For example [0, 1]^ = [0, 1] x [0, 1], or = M x M (the Cartesian plane). 

Definition 0.3.11. A function / : A -> is a subset / of A x 5 such that for each x eA, there is a 
unique (x, j) G /. We then write f{x) — y. Sometimes the set / is called the graph of the function 
rather than the function itself. 

The set A is called the domain of / (and sometimes confusingly denoted D{f)). The set 

R{f) :— {y E B : there exists an x such that f{x) —y} 

is called the range of /. 

Note that R{f) can possibly be a proper subset of B, while the domain of / is always equal to A. 
We usually assume that the domain of / is nonempty. 

Example 0.3.12: From calculus, you are most familiar with functions taking real numbers to real 
numbers. However, you have seen some other types of functions as well. For example the derivative 
is a function mapping the set of differentiable functions to the set of all functions. Another example 
is the Laplace transform, which also takes functions to functions. Yet another example is the 
function that takes a continuous function g defined on the interval [0, 1] and returns the number 
Jig{x)dx. 

Definition 0.3.13. Let / : A — > 5 be a function, and C C A. Define the image (or direct image) of C 
as 

/(C) :^{f{x)EB:xEC}. 
Let D dB. Define the inverse image as 

r\D) ■.= {xEA:f{x)ED}. 

Example 0.3.14: Define the function / : R ^ M by f{x) := sin(;r;c). Then /([0, 1/2]) = [0, 1], 
ri({0}) = Z,etc.... 

Proposition 0.3.15. Let f:A^B. Let C, D be subsets ofB. Then 

r\CUD)=f-\C)Uf-\D), 

r\cnD)=f-\c)nf-\D), 

f-\n = {f-\c)Y. 

Read the last line as ^ (5 \ C) = A \ /- ^ (C) . 

Proof. Let us start with the union. Suppose that x E f~^{C[JD). That means that x maps to C or D. 
Thus /-i(CUZ)) C f~HC) ^f~\D). Conversely if x e f~\C), thenx e /-^(CUD). Similarly 
for xEf-^{D). Hence f-\CUD) D f-\C)Uf-\D), and we are have equaUty. 

The rest of the proof is left as an exercise. □ 
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The proposition does not hold for direct images. We do have the following weaker result. 
Proposition 0.3.16. Let f: A^B. Let C,D be subsets of A. Then 

ncuD)=f{c)unD), 

f{cnD)(if{c)nf{D). 

The proof is left as an exercise. 

Definition 0.3.17. Let / : A — )■ 5 be a function. The function / is said to be injective or one-to-one 
if f{x\) — f{x2) implies xi = X2. In other words, f^^{{y}) is empty or consists of a single element 
for all y E B. We call such an / an injection. 

The function / is said to be surjective or onto if /(A) = B. We call such an / a surjection. 

A function / that is both an injection and a surjection is said to be bijective, and we say / is a 
bijection. 

When / : A — > 5 is a bijection, then f~^i{y}) is always a unique element of A, and we can 
consider / ' as a function /^^ : 5 A. In this case, we call the inverse function of /. For 
example, for the bijection f{x) :=x^ we have f^^{x) = y6c. 

A final piece of notation for functions that we need is the composition of functions. 

Definition 0.3.18. Let / : A fi, g : 5 C. The function g o / : A C is defined as 

{gof){x):=g{f{x)). 

0.3.4 Cardinality 

A subtle issue in set theory and one generating a considerable amount of confusion among students is 
that of cardinality, or "size" of sets. The concept of cardinality is important in modem mathematics 
in general and in analysis in particular. In this section, we will see the first really unexpected 
theorem. 

Definition 0.3.19. Let A and B be sets. We say A and B have the same cardinality when there exists 
a bijection f:A—>B. We denote by |A| the equivalence class of all sets with the same cardinality 
as A and we simply call |A| the cardinality of A. 

Note that A has the same cardinality as the empty set if and only if A itself is the empty set. We 
then write |A| := 0. 

Definition 0.3.20. Suppose that A has the same cardinality as { 1 , 2, 3 , . . . , n} for some n e N. We 
then write |A| := n, and we say that A is finite. When A is the empty set, we also call A finite. 
We say that A is infinite or "of infinite cardinality" if A is not finite. 
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That the notation |A| = n is justified we leave as an exercise. That is, for each nonempty finite set 
A, there exists a unique natural number n such that there exists abijection fromA to {1,2,3, . . . 
We can also order sets by size. 

Definition 0.3.21. We write 

|A| < \B\ 

if there exists an injection from A to B. We write |A| = |5| if A and B have the same cardinality. We 
write |A| < |5| if |A| < but A and B do not have the same cardinality. 

We state without proof that |A| = \B\ have the same cardinality if and only if |A| < |5| and 
\B\ < \A\. This is the so-called Cantor-Bernstein-Schroeder theorem. Furthermore, if A and B are 
any two sets, we can always write |A| < \B\ or \B\ < \A\. The issues surrounding this last statement 
are very subtle. As we do not require either of these two statements, we omit proofs. 

The truly interesting cases of cardinality are infinite sets. We start with the following definition. 

Definition 0.3.22. If |A| = |N|, then A is said to be countably infinite. If A is finite or countably 
infinite, then we say A is countable. If A is not countable, then A is said to be uncountable. 

The cardinality of N is usually denoted d& (read as aleph-naught)*. 

Example 0.3.23: The set of even natural numbers has the same cardinality as N. Proof: Given an 
even natural number, write it as In for some n G N. Then create a bijection taking In ion. 

In fact, let us mention without proof the following characterization of infinite sets: A set is 
infinite if and only if it is in one-to-one correspondence with a proper subset of itself. 

Example 0.3.24: N x N is a countably infinite set. Proof: Arrange the elements of N x N as follows 

(1,1), (1,2), (2, 1), (1,3), (2,2), (3, 1), That is, always write down first all the elements whose 

two entries sum to k, then write down all the elements whose entries sum to +1 and so on. Then 
define abijection with N by letting 1 go to (1,1), 2 go to (1,2) and so on. 

Example 0.3.25: The set of rational numbers is countable. Proof: (informal) Follow the same 

procedure as in the previous example, writing i/i, 1/2, 2/1, etc However, leave out any fraction 

(such as 2/2) that has already appeared. 

For completeness we mention the following statement. If A C B and B is countable, then A is 
countable. Similarly if A is uncountable, then B is uncountable. As we will not need this statement 
in the sequel, and as the proof requires the Cantor-Bemstein-Schroeder theorem mentioned above, 
we will not give it here. 

We give the first truly striking result. First, we need a notation for the set of all subsets of a set. 

Definition 0.3.26. If A is a set, we define the power set of A, denoted by ^(A), to be the set of all 

subsets of A. 



*For the fans of the TV show Futurama, there is a movie theater in one episode called an Ko-plex. 
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For example, if A := {1,2}, then ,^(A) = {0,{1},{2},{1,2}}. Note that for a finite set A of 
cardinality n, the cardinality of ^(A) is 2". This fact is left as an exercise. That is, the cardinality 
of ^{A) is strictly larger than the cardinality of A, at least for finite sets. What is an unexpected 
and striking fact is that this statement is still true for infinite sets. 

Theorem 0.3.27 (Cantor*). |A| < \^{A)\. In particular, there exists no surjection from A onto 
^(A). 

Proof. There of course exists an injection / : A — )■ ^(A). For any x E A, define f{x) := {x}. 
Therefore |A| < |^(A)|. 

To finish the proof, we have to show that no function / : A — )■ ^{A) is a surjection. Suppose 
that / : A — 7- ^(A) is a function. So for x eA, f{x) is a subset of A. Define the set 

B:= {xEA:xif{x)}. 

We claim that B is not in the range of / and hence / is not a surjection. Suppose that there exists 
an xq such that /(xq) = B. Either xq E B ox xq ^ B. If xq E B, then xq ^ f{xo) = B, which is a 
contradiction. If xq ^ B, then xq E f{xo) — B, which is again a contradiction. Thus such an xq does 
not exist. Therefore, B is not in the range of /, and / is not a surjection. As / was an arbitrary 
function, no surjection exists. □ 

One particular consequence of this theorem is that there do exist uncountable sets, as ^(N) 
must be uncountable. This fact is related to the fact that the set of real numbers (which we study 
in the next chapter) is uncountable. The existence of uncountable sets may seem unintuitive, and 
the theorem caused quite a controversy at the time it was announced. The theorem not only says 
that uncountable sets exist, but that there in fact exist progressively larger and larger infinite sets N, 
^(N), ^(^(N)), ^{^{^{N))), etc. . . . 

0.3.5 Exercises 

Exercise 0.3.1: Show A \ (B n C) = (A \ B) U (A \ C). 

Exercise 0.3.2: Prove that the principle of strong induction is equivalent to the standard induction. 
Exercise 0.3.3: Finish the proof of Proposition 0.3.15. 
Exercise 0.3.4: a ) Prove Proposition 0.3.16. 

b) Find an example for which equality of sets in f{Cr\D) C /(C) Df^D) fails. That is, find an f. A, B, C, 
and D such that /(COD) is a proper subset of f{C)nf{D). 

Exercise 0.3.5 (Tricky): Prove that if A is finite, then there exists a unique number n such that there exists a 
bijection between A and {1,2,3, ... ,n}. In other words, the notation \A\ := n is justified. Hint: Show that if 
n> m, then there is no injection from {1,2,3, ... ,«} to {1,2,3, .. . ,m}. 



*Named after the German mathematician Georg Ferdinand Ludwig Phihpp Cantor (1845 - 1918). 
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Exercise 0.3.6; Prove 

a) An (sue) = (AnB)U(AnC) 

b) AU(BnC) = (AUS)n(AUC) 

Exercise 0.3.7: LetAAB denote the symmetric difference, that is, the set of all elements that belong to either 
A or B, but not to both A and B. 

a) Draw a Venn diagram for AAB. 

b) Show AAB = {A\B)U{B\A). 

c) Show AAB = {A\JB)\{AnB). 

Exercise 0.3.8: For each n eN, let A„ := {{n + \)k : k eN}. 

a) FindAif\A2. 

b) Find\X=\^n- 

c) Find(X=iA„. 

Exercise 0.3.9: Determine ^{S) (the power set) for each of the following: 



a) 


S = 


0, 


b) 


S = 


{1}, 


c) 


S = 


{1,2}, 


d) 


s = 


{1,2,3 



Exercise 0.3.10: Let f:A^B and g: B be functions. 

a) Prove that ifgof is injective, then f is injective. 

b) Prove that ifg o f is surjective, then g is surjective. 

c) Find an explicit example where go f is bijective, but neither f nor g are bijective. 
Exercise 0.3.11: Prove that n <2" by induction. 

Exercise 0.3.12: Show that for a finite set A of cardinality n, the cardinality of ^{A) is 2". 
Exercise 0.3.13: Prove + j!? H ^ = ^ for all n G N. 

Exercise 0.3.14: Prove +2^ + ---+n^ = (^'^^Y for all n e N. 
Exercise 0.3.15: Prove that n^ + 5n is divisible by 6 for all n G N. 

Exercise 0.3.16: Find the smallest n G N such that 2{n + 5)^ < n^ and call it no- Show that 2{n + 5)^ < n^ 
for all n > no- 

Exercise 0.3.17: Find allneN such that n^ < 2". 
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Exercise 0.3.18; Finish the proof that the principle of induction is equivalent to the well ordering property 
ofN. That is, prove the well ordering property for N using the principle of induction. 

Exercise 0.3.19; Give an example of a countable collection of finite sets Ai,A2, . . ., whose union is not a 
finite set 

Exercise 0.3.20; Give an example of a countable collection of infinite sets Ai,A2, . . ., with Ay HA/, being 
infinite for all j and k, such that 07=1^7 '■^ nonempty and finite. 



INTRODUCTION 



Chapter 1 
Real Numbers 



1.1 Basic properties 

Note: 1.5 lectures 

The main object we work with in analysis is the set of real numbers. As this set is so fundamental, 
often much time is spent on formally constructing the set of real numbers. However, we take an 
easier approach here and just assume that a set with the correct properties exists. We need to start 
with some basic definitions. 

Definition 1.1.1. An ordered set is a set A, together with a relation < such that 

(i) For any x,y eA, exactly one oix <y,x — y,oxy <x holds. 

(ii) If jc < _y and y <z, then x<z. 

For example, the rational numbers Q are an ordered set by letting x<y if and only if 3; —x is a 
positive rational number. Similarly, N and Z are also ordered sets. 

We write Jc<_yifx<};orx = }'. We define > and > in the obvious way. 

Definition 1.1.2. Let E <zA, where A is an ordered set. 

(i) If there exists a Z? G A such that < for all x e £, then we say E is bounded above and b is 
an upper bound of E. 

(ii) If there exists ab eA such that jc > Z? for all xeE, then we say E is bounded below and b is 
a lower bound of E. 

(iii) If there exists an upper bound bo of E such that whenever b is any upper bound for E we have 
bo < b, then bo is called the least upper bound or the supremum of E. We write 

supE :— bo- 



ll 
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(iv) Similarly, if there exists a lower bound bo of E such that whenever b is any lower bound for 
E we have bo > b, then bo is called the greatest lower bound or the infimum of E. We write 

inf E := &o- 

When a set E is both bounded above and bounded below, we say simply that E is bounded. 
Note that a supremum or infimum for E (even if they exist) need not be in For example the 
set {x e Q : X < 1} has a least upper bound of 1, but 1 is not in the set itself. 

Definition 1.1.3. An ordered set A has the least-upper-bound property if every nonempty subset 
E <zA that is bounded above has a least upper bound, that is sup E exists in A. 

The least-upper-bound property is sometimes called the completeness property or the Dedekind 
completeness property. 

Example 1.1.4: For example Q does not have the least-upper-bound property. The set {x G Q : 
< 2} does not have a supremum. The obvious supremum \/2 is not rational. Suppose that — 2 
for some x e Q. Write x = »»/« in lowest terms. So = 2 or = 2n^. Hence m^ is divisible 

by 2 and so m is divisible by 2. We write m = 2k and so we have {2k)^ = 2n^. We divide by 2 and 
note that 2k^ = n^ and hence n is divisible by 2. But that is a contradiction as we said »»/« was in 
lowest terms. 

That Q does not have the least-upper-bound property is one of the most important reasons 

why we work with IR in analysis. The set Q is just fine for algebraists. But analysts require the 
least-upper-bound property to do any work. We also require our real numbers to have many algebraic 
properties. In particular, we require that they are a field. 

Definition 1.1.5. A set F is called afield if it has two operations defined on it, addition x-l-y and 
multiplication xy, and if it satisfies the following axioms. 

(Al) If xeF andy eF, then x + yeF. 

(A2) ( commutativity of addition )lfx + y = y-\-x for all x, j G F . 

(A3) (associativity of addition) If {x-\-y)-\-z = x-\- {y + z) for allx,y,z G F . 

(A4) There exists an element G F such that -I- x = x for all x G F. 

(A5) For every element x G F there exists an element — x G F such that x -I- (— x) = 0. 

(Ml) If X G F and y G F, then xy G F. 

(M2) (commutativity of multiplication) If xy = yx for all x,y G F. 
(M3) (associativity of multiplication) If (xy)z = xijz) for all x,y,z G F. 
(M4) There exists an element 1 (and 1 7^ 0) such that Ix = x for all x G F. 
(M5) For every x E F such that x 7^ there exists an element i/x G F such that x(i/x) = 1. 
(D) ( distributive law j x (y -|- z) = xy -|- xz for all x, y, z G F . 
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Example 1.1.6: The set Q of rational numbers is a field. On the other hand Z is not a field, as it 
does not contain multiplicative inverses. 

We will assume the basic facts about fields that can easily be proved from the axioms. For 
example, Ojc = can be easily proved by noting that xx= {0 + x)x = 0x + xx, using (A4), (D), and 
(M2). Then using (A5) on xx we obtain = Ox. 

Definition 1.1.7. A field F is said to be an ordered field if F is also an ordered set such that: 

(i) For jc,j,z G F,x<y implies x + z < j + z. 

(ii) For x,y E F , x> and y>0 implies xy > 0. 

If jc > 0, we say x is positive. If x < 0, we say x is negative. We also say x is nonnegative if x > 0, 
and X is nonpositive if x < 0. 

Proposition 1.1.8. Let F be an ordered field andx,y,z G F. Then: 

(i) Ifx > 0, then —x<0 (and vice-versa). 

( ii) Ifx > and y < z, then xy < xz. 
( Hi) Ifx < and y < z, then xy > xz. 

(iv) Ifx ^ 0, then x^ > 0. 

(v) IfO<x<y, then < i/y < ^x. 

Note that (iv) implies in particular that 1 > 0. 

Proof. Let us prove (i). The inequality x > implies by item (i) of definition of ordered field 
that x+ (—x) > + {—x). Now apply the algebraic properties of fields to obtain > —x. The 
"vice- versa" follows by similar calculation. 

For (ii), first notice that y < z implies < z — yhy applying item (i) of the definition of ordered 
fields. Now apply item (ii) of the definition of ordered fields to obtain < x{z — y). By algebraic 
properties we get < xz — xy, and again applying item (i) of the definition we obtain xy < xz. 

Part (iii) is left as an exercise. 

To prove part (iv) first suppose that x> 0. Then by item (ii) of the definition of ordered fields we 
obtain that x^ > (use y = x). If x < 0, we use part (iii) of this proposition. Plug my =x and z = 0. 

Finally to prove part (v), notice that i/x cannot be equal to zero (why?). Suppose i/x < 0, then 
-i/j > by (i). Then apply part (ii) (as jc > 0) to obtain x{-^/x) > Ox or —1 > 0, which contradicts 
1 > by using part (i) again. Hence i/x > 0. Similarly !/>■ > 0. Thus (V^)( V)') > by definition of 
ordered field and by part (ii) 

~ mmx<mmy. 

By algebraic properties we get i/v < i/x. □ 

Product of two positive numbers (elements of an ordered field) is positive. However, it is not 
true that if the product is positive, then each of the two factors must be positive. 



24 



CHAPTER 1. REAL NUMBERS 



Proposition 1.1.9. Let x,y G F where F is an ordered field. Suppose that xy > 0. Then either both 
X and y are positive, or both are negative. 

Proof. It is clear that both possibiUties can happen. If either x and y are zero, then xy is zero and 
hence not positive. Hence we can assume that x and y are nonzero, and we simply need to show that 
if they have opposite signs, then xy < 0. Without loss of generality suppose that x> and y <0. 
Multiply y < by .x: to get xy <Qx — 0. The result follows by contrapositive. □ 

1.1.1 Exercises 

Exercise 1.1.1: Prove part (Hi) of Proposition 1.1.8. 

Exercise 1.1.2; Let S be an ordered set Let A C S be a nonempty finite subset. Tiien A is bounded. 
FurtJiermore, inf A exists and is in A and sup A exists and is in A. Hint: Use induction. 

Exercise 1.1.3: Let x,y ^F, witere F is an ordered field. Suppose that Q <x <y. Show that x^ < y^. 

Exercise 1.1.4: Let S be an ordered set. Let B <ZS be bounded (above and below). Let A gB be a nonempty 
subset Suppose that all the inf's and sup's exist. Show that 

inf B < inf A < sup A < sup B. 

Exercise 1.1.5: Let S be an ordered set. Let A C S and suppose that b is an upper bound for A. Suppose that 
b ^ A. Show that b = sup A. 

Exercise 1.1.6: Let S be an ordered set Let Ac S be a nonempty subset that is bounded above. Suppose that 
sup A exists and tliat sup A ^ A. Show that A contains a countably infinite subset In particular, A is infinite. 

Exercise 1.1.7: Find a (nonstandard) ordering of the set of natural numbers N such that there exists a 
nonempty proper subset A C N and such that sup A exists in N, but sup A^A. 

Exercise 1.1.8: Let F = {0, 1 , 2}. a) Prove that there is exactly one way to define addition and multiplication 
so that F is afield ifO and 1 have their usual meaning of(A4) and (M4). b) Show that F cannot be an ordered 
field. 

Exercise 1.1.9: Let S be an ordered set and A is a nonempty subset such that sup A exists. Suppose that 
there is a B C A such that whenever x €z A there is a y €z B such that x <y. Show that sup B exists and 
sup B = sup A. 



1.2. THE SET OF REAL NUMBERS 



25 



1.2 The set of real numbers 

Note: 2 lectures, the extended real numbers are optional 

1.2.1 The set of real numbers 

We finally get to the real number system. Instead of constructing the real number set from the 
rational numbers, we simply state their existence as a theorem without proof. Notice that Q is an 
ordered field. 

Theorem 1.2.1. There exists a unique^ordered field M with the least-upper-bound property such 
that Q C M. ~ 

Note that also N C Q. As we have seen, 1 > 0. By induction (exercise) we can prove that n> 
for all n G N. Similarly we can easily verify all the statements we know about rational numbers and 
their natural ordering. 

Let us prove one of the most basic but useful results about the real numbers. The following 
proposition is essentially how an analyst proves that a number is zero. 

Proposition 1.2.2. Ifx E M is such that x>0 and x < efor all £ eM. where £ > 0, then x = 0. 
Proof. If X > 0, then < ^2 < x (why?). Taking e = ^/i obtains a contradiction. Thus x = 0. □ 

A related simple fact is that any time we have two real numbers a < b, then there is another real 
number c such that a < c < b. Just take for example c = (why?). In fact, there are infinitely 
many real numbers between a and b. 

The most useful property of M for analysts is not just that it is an ordered field, but that it has the 
least-upper-bound property. Essentially we want Q, but we also want to take suprema (and infima) 
willy-nilly. So what we do is to throw in enough numbers to obtain M. 

We have already seen that M must contain elements that are not in Q because of the least-upper- 
bound property. We have seen that there is no rational square root of two. The set {x E Q: x^ <2} 
implies the existence of the real number \/2 that is not rational, although this fact requires a bit of 
work. 

Example 1.2.3: Claim: There exists a unique positive real number r such that = 2. We denote r 
by V2. 

Proof. Take the set A := {x E : x^ < 2} . First if x^ < 2, then x <2. To see this fact, note that 
x>2 implies x^ >4 (use Proposition 1.1.8, we will not explicitly mention its use from now on), 
hence any number x such that jc > 2 is not in A. Thus A is bounded above. On the other hand, 1 G A, 
so A is nonempty. 

* Uniqueness is up to isomorphism, but we wish to avoid excessive use of algebra. For us, it is simply enough to 
assume that a set of real numbers exists. See Rudin [R2] for the construction and more details. 
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Let us define r := sup A. We will show that = 2 by showing that > 2 and <2. This is 
the way analysts show equality, by showing two inequalities. We already know that r > 1 > 0. 

In the following, it may seem that we are pulling certain expressions out of a hat. When writing 
a proof such as this we would of course come up with the expressions only after playing around 
with what we wish to prove. The order in which we write the proof is not necessarily the order in 
which we come up with the proof. 

Let us first show that > 2. Take a positive number s such that < 2. We wish to find an 
h> such that (s + h)^ < 2. As 2 - 5^ > 0, we have > 0. We choose an /i e M such that 
<h < Furthermore, we assume that h<l. 

{s + hf-s^ = h{2s + h) 

<h{2s+l) (since /j<l) 



<2 



o2- 

„2 ■ • ^ 



smce h < 

V 25+ ly 



Therefore, (s + h) < 2. Hence s + h eA, but as /z > we have s + h> s. So s < r = sup A. As s 
was an arbitrary positive number such that < 2, it follows that > 2. 

Now take a positive number s such that > 2. We wish to find an /z > such that {s — h)^ > 2. 
As 5^ - 2 > we have > 0. We choose anheR such that 0<h< and h<s. 

^-{s-hf = lsh-h^ 
< 2sh 



2 . f. , s^-A 

<s —2 since h < — — 

V 25 ; 



By subtracting from both sides and multiplying by —1, we find (s — h) > 2. Therefore s — h^A. 

Furthermore, iix>s — h, then > (s — Kf' > 2 (as x > and s — h>Qi) and so x^A. Thus 
s — /i is an upper bound for A. However, s — h<s,or'm other words s> r= sup A. Thus < 2. 

Together, > 2 and < 2 imply = 2. The existence part is finished. We still need to handle 
uniqueness. Suppose that 5 G M such that = 2 and 5 > 0. Thus = r^. However, if < 5 < r, 
then < r^. Similarly < r < 5 implies <s^. Hence s = r. □ 

The number a/2 ^ Q. The set M \ Q is called the set of irrational numbers. We have just seen 
that M \ Q is nonempty. Not only is it nonempty, we will see later that is it very large indeed. 

Using the same technique as above, we can show that a positive real number x^/" exists for all 
n G N and all x > 0. That is, for each x > 0, there exists a unique positive real number r such that 
r" = X. The proof is left as an exercise. 
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1.2.2 Archimedean property 

As we have seen, there are plenty of real numbers in any interval. But there are also infinitely 
many rational numbers in any interval. The following is one of the fundamental facts about the real 
numbers. The two parts of the next theorem are actually equivalent, even though it may not seem 
like that at first sight. 

Theorem 1.2.4. 

(i) (Archimedean property) Ifx^y E M andx > 0, then there exists an n eN such that 

nx > y. 

( ii) (Q is dense in M) Ifx, y eM. and x <y, then there exists an r eQ such that x < r <y. 

Proof. Let us prove (i). We divide through by x and then (i) says that for any real number t := y/x, 
we can find natural number n such that n> t. In other words, (i) says that N C M is not bounded 
above. Suppose for contradiction that N is bounded above. Let b :— supN. The number b — I 
cannot possibly be an upper bound for N as it is strictly less than b (the supremum). Thus there 
exists an m G N such that m> b — I. We add one to obtain m+l > b, which contradicts b being an 
upper bound. 

Let us tackle (ii). First assume that x>0. Note that y — x>0. By (i), there exists an « G N such 

that 

n{y — x) > 1. 

Also by (i) the set A {fc G N : > nx} is nonempty. By the well ordering property of N, A has a 
least element m. As m E A, then m > nx. As m is the least element of A, m — 1 ^ A. If m > 1, then 
m — 1 G N, but m — 1 ^ A and so m — 1 < ra:. If m = 1, then m — l—0, and m — I <nx still holds 
as .x: > 0. In other words, 

m — 1 <nx<m. 

We divide through by n to get x < On the other hand from n{y — x) > 1 we obtain ny > 1+ nx. 
As njc > m — 1 we get that l+nx> m and hence ny > m and therefore y > "i/n. Putting everything 
together we obtain x < "i/n < y. So let r = '"/«. 

Now assume that x <Q.\fy > 0, then we just take r = 0. If j < 0, then note that < —y < —x 
and find a rational q such that —y<q< —x. Then take r = —q. □ 

Let us state and prove a simple but useful corollary of the Archimedean property. 

Corollary 1.2.5. inf{i/« : n G N} = 0. 

Proof. Let A := {i/n : n G N}. Obviously A is not empty. Furthermore, i/« > and so is a lower 
bound, and b:= 'mfA exists. As is a lower bound, then b>0. Now take an arbitrary a > 0. By the 
Archimedean property there exists an n such that na> 1, or in other words a> ^/nEA. Therefore a 
cannot be a lower bound for A. Hence b = 0. □ 
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1.2.3 Using supremum and infimum 

We want to make sure that suprema and infima are compatible with algebraic operations. For a set 
A c R and a number x define 

x + A:= {x + y eR:y eA}, 
xA := {xy eR:yeA}. 

Proposition 1.2.6. Let A (zMbe bounded and nonempty. 

(i) Ifx e R, then sup(x+A) = x + sup A. 

(ii) Ifx e R, then inf(x+A) =x + inf A. 
(Hi) Ifx > 0, then sup{xA) = x(sup A). 

(iv) Ifx > 0, then inf(xA) = jc(inf A). 

(v) Ifx < 0, then sup(jcA) = jc(inf A). 

(vi) Ifx < 0, then inf(;cA) —x{supA). 

Do note that multiplying a set by a negative number switches supremum for an infimum and 
vice- versa. The proposition also implies that if A is nonempty and bounded then xA and x + A are 
nonempty and bounded. 

Proof. Let us only prove the first statement. The rest are left as exercises. 

Suppose that is an upper bound for A. That is,y<b for ally E A. Then x+y <x + b for all 
y eA, and so jc + Z? is an upper bound for jc + A. In particular, ifb = sup A, then 

sup{x+A) <x + b = x+supA. 

The other direction is similar. If b is an upper bound for x+A, then x + y <b for all j e A and 
soy <b — xfor ally eA. So Z?— x is an upper bound for A. Ifb — sup(x+A), then 

sup A < b — x= sup(x+A) —X. 

And the result follows. □ 

Sometimes we need to apply supremum or infimum twice. Here is an example. 

Proposition 1.2.7. Let A^B GM.be nonempty sets such that x <y whenever x e A and y eB. Then 
A is bounded above, B is bounded below, and sup A < inf 5. 

Proof Any x G A is a lower bound for B. Therefore x < inf B for all x e A, so inf B is an upper 
bound for A. Hence, sup A < inf B. □ 
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We have to be careful about strict inequalities and taking suprema and infima. Note that x <y 
whenever x E A and y E B still only implies sup A < inf B, and not a strict inequality. This is an 
important subtle point that comes up often. For example, take A := {0} and take B := {^/n : n eN}. 
Then < i/n for all n eN. However, sup A = and inf 5 = 0. 

The proof of the following often used elementary fact is left to the reader. A similar statement 
holds for infima. 

Proposition 1.2.8. IfS cM. is a nonempty bounded set, then for every e > there exists xES such 
that sup S — e<x< sup S. 

To make using suprema and infima even easier, we may want to write sup A and inf A without 
worrying about A being bounded and nonempty. We make the following natural definitions. 

Definition 1.2.9. Let A c M be a set. 

(i) If A is empty, then sup A := — oo. 

(ii) If A is not bounded above, then sup A := oo. 

(iii) If A is empty, then inf A : = oo. 

(iv) If A is not bounded below, then inf A := — oo. 

For convenience, oo and — oo are sometimes treated as if they were numbers, except we do not 
allow arbitrary arithmetic with them. We make M* := M U {— oo, oo} into an ordered set by letting 

— oo < oo and — oo < X and x < oo for all x G M. 

The set M* is called the set of extended real numbers. It is possible to define some arithmetic on 
M*. Most operations are extended in an obvious way, but we must leave oo — oo, ■ (±oo), and ^ 
undefined. We refrain from using this arithmetic so as it leads to easy mistakes as M* is not a field. 
Now we can take suprema and infima without fear of emptiness or unboundedness. In this book we 
mostly avoid using M* outside of exercises, and leave such generalizations to the interested reader. 

1.2.4 Maxima and minima 

By Exercise 1.1.2 we know that a finite set of numbers always has a supremum or an infimum that 
is contained in the set itself. In this case we usually do not use the words supremum or infimum. 

When we have a set A of real numbers bounded above, such that sup A G A, then we can use the 
word maximum and the notation maxA to denote the supremum. Similarly for infimum; when a set 
A is bounded below and inf A G A, then we can use the word minimum and the notation min A. For 
example, 

max{l,2.4,:/r,100} = 100, 
min{l,2.4,;r,100} = 1. 

While writing sup and inf may be technically correct in this situation, max and min are generally 
used to emphasize that the supremum or infimum is in the set itself. 
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1.2.5 Exercises 

Exercise 1.2.1; Prove that ift > (t G M.), then there exists an « G N such that <t. 

Exercise 1.2.2: Prove that ift > ff G Mj, then there exists ann gN such that n — I <t <n. 

Exercise 1.2.3: Finish the proof of Proposition 1.2.6. 

Exercise 1.2.4: Let x, y G M. Suppose that x^ = 0. Prove that x = and y = 0. 

Exercise 1.2.5: Show that \/3 is irrational. 

Exercise 1.2.6: Let « G N. Show that either \Jn is either an integer or it is irrational. 

Exercise 1.2.1: Prove the arithmetic-geometric mean inequality. That is, for two positive real numbers x,y 
we have 

^.x+y 

v^< • 

Furthermore, equality occurs if and only ifx = y. 

Exercise 1.2.8: Show that for any two real numbers such that x <y, we have an irrational number s such 

X y 

that X < s <y. Hint: Apply the density o/Q to —j= and —=. 

Exercise 1.2.9: Let A and B be two nonempty bounded sets of real numbers. Let C := {a + b : a £ A, b G B}. 
Show that C is a bounded set and that 

sup C = sup A + sup B and inf C = inf A +inf B. 

Exercise 1.2.10: Let A and B be two nonempty bounded sets of nonnegative real numbers. Define the set 
C '.= {ab : a £ A,b £ B}. Show that C is a bounded set and that 

sup C = (sup A) (sup B) and inf C = (inf A)(inf B). 

Exercise 1.2.11 (Hard): Given x> and « G N, show that there exists a unique positive real number r such 
that X = r". Usually r is denoted by x^l". 
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1.3 Absolute value 

Note: 0.5-1 lecture 

A concept we will encounter over and over is the concept of absolute value. You want to think 
of the absolute value as the "size" of a real number. Let us give a formal definition. 



Let us give the main features of the absolute value as a proposition. 

Proposition 1.3.1. 

(i) \x\ > 0, and \x\ = if and only ifx = 0. 

( ii) I —x\ = \x\ for all jc G M. 
(in) \xy\ = \x\ \y\for all x,y G M. 

I 1 2 7 

(iv) \x\ = X for all X eM.. 

(v) \x\ <y if and only if —y <x <y. 

(vi) — l.^! < jc < \x\ for all x G M. 

Proof, (i): This statement is obvious from the definition. 

(ii) : Suppose that x > 0, then \ —x\ = — (— = x = \x\. Similarly when x < 0, or x = 0. 

(iii) : Ifxory is zero, then the result is obvious. When x and y are both positive, then \x\ \y\ = xy. 
xy is also positive and hence xy = \xy\. Ifx and y are both negative then xy is still positive and xy = 
\xy\, and \x\ \y\ = {—x){—y) = xy. Next assume that x> and y <0. Then \x\ \y\ = x{—y) = —{xy). 
Now xy is negative and hence \xy\ = — {xy). Similarly if x < and y >0. 

(iv) : Obvious if jc > 0. If jc < 0, then \x\^ = {-x)^ = x^. 

(v) : Suppose that \x\ < y. If jc > 0, then x < y. Obviously y > and hence —y<0<xso 
—y^x^y holds. If jc < 0, then \x\ < y means —x < y. Negating both sides we get x > —y. Again 
y >0 and so 3^ > > jc. Hence, —y<x<y. 

On the other hand, suppose that —y<x<yis true. If x > 0, then x <y is equivalent to \x\ < y. 
If X < 0, then —y<x implies {—x) < y, which is equivalent to \x\ < y. 

(vi) : Apply (v) with j = |.)c|. □ 
A property used frequently enough to give it a name is the so-called triangle inequality. 

Proposition 1.3.2 (Triangle Inequality). \x + y\ < \x\ + \y\for allx,y G M. 




X 



if jc>0, 
X iix < 0. 
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Proof. From Proposition 1.3.1 we have — \x\ < x < \x\ and — \y\ < y < \y\. We add these two 
inequalities to obtain 

~{\x\ + \y\)<x + y<\x\ + \y\. 
Again by Proposition 1.3.1 we have that \x + y\ < \x\ + \y\. □ 

There are other versions of the triangle inequality that are applied often. 
Corollary 1.3.3. Let x,y e M 

(i) (reverse triangle inequality) — < — j|. 

(ii) \x-y\ < \x\ + \y\. 

Proof. Let us plug inx = a — b and y = b into the standard triangle inequality to obtain 

\a\ = \a — b + b\ < \a — b\ + \b\, 

or \a\ — \b\ < \a — b\. Switching the roles of a and b we obtain or \b\ — \a\ <\b — a\ = \a — b\. Now 
applying Proposition 1.3.1 again we obtain the reverse triangle inequality. 

The second version of the triangle inequality is obtained from the standard one by just replacing 
y with —J and noting again that |— y| = □ 

Corollary 1.3.4. Let xi,X2: ...,XnER. Then 

\xi +X2-\ \-Xn\ < \xi\ + \X2\ H h \Xn\ ■ 

Proof. We proceed by induction. The conclusion holds trivially for n = I, and for n = 2 it is 
the standard triangle inequality. Suppose that the corollary holds for n. Take n + I numbers 
xi,X2,.. ■ ,Xn+i and first use the standard triangle inequality, then the induction hypothesis 

\xi+X2-\ \-Xn+Xn+l \ < \xi+X2-{ hJCn| + |x„+l| 

< |jCl| + \x2\-\ \-\Xn\ + \Xn+l\. 

□ 

Let us see an example of the use of the triangle inequality. 

Example 1.3.5: Find a number M such that — 9a: + 1 1 < M for all — 1 <x<5. 
Using the triangle inequality, write 

\x^-9x+l\ < |jc2| + |9jc| + |1| = \x\^ + 9\x\ + l. 

It is obvious that + 9|x| + 1 is largest when \x\ is largest. In the interval provided, \x\ is largest 
when x = 5 and so \x\ =5. One possibility for M is 

M = 52 + 9(5) + l =71. 

There are, of course, other M that work. The bound of 71 is much higher than it need be, but we 
didn't ask for the best possible M, just one that works. 
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The last example leads us to the concept of bounded functions. 

Definition 1.3.6. Suppose / : D — )■ M is a function. We say / is bounded if there exists a number 
M such that |/(jc) | < M for all x^D. 

In the example we have shown that — 9x + 1 is bounded when considered as a function on 
D = {x : — \ <x<5}. On the other hand, if we consider the same polynomial as a function on the 
whole real line M, then it is not bounded. 

For a function / : D — t- M we write 

sup/(x) := sup /(D), 
inf/W:=inf/(D). 

To illustrate some common issues, let us prove the following proposition. 
Proposition 1.3.7. If f: Z) — )• M and g: D — t- M (D nonempty) are bounded^functions and 

/W < g{^) for all X E D, 

then 

supf{x)<supg{x) and inf f{x) < inf g{x). (1.1) 

xeD xeD x<ED 

You should be careful with the variables. The x on the left side of the inequality in 1.1 is different 
from the x on the right. You should really think of the first inequality as 

sup/W < sup^(y). 

xeD yeD 

Let us prove this inequality. If b is an upper bound for g{D), then f{x) < g{x) < b for all x E D, 
and hence b is an upper bound for f{D) . Taking the least upper bound we get that for aWx ED 

f{x) < sup^(3;). 

Therefore sup^,^j^g{y) is an upper bound for f{D) and thus greater than or equal to the least upper 
bound of /(D). 

sup/(x) < supg(j). 

xeD yeD 

The second inequality (the statement about the inf) is left as an exercise. 
Do note that a common mistake is to conclude that 

snpfix) < inf g{y). (1.2) 

xeD 



*The boundedness hypothesis is for simpHcity, it can be dropped if we allow for the extended real numbers. 
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The inequality (1.2) is not true given the hypothesis of the claim above. For this stronger inequality 
we need the stronger hypothesis 

f{x) < g{y) for aWxeD and yeD. 

The proof is left as an exercise. 

1.3.1 Exercises 

Exercise 1.3.1; Show that \x — y\ <e if and only ifx — £<y<x + £. 

Exercise 1.3.2; Show that 

a) max|x,3'j = — --^ — - 

b) rmn{x,y} = ^^^^^^^^^ 

Exercise 1.3.3; Find a number M such that \x^ —x^ + 8;c| < M for all —2 <x<\0. 

Exercise 1.3.4; Finish the proof of Proposition 1.3. 7. That is, prove that given any set D, and two bounded 
functions / : D — )• R and g : D — )• M such that f{x) < g{x) for all x£ D, then 

inf fix) < inf g(x). 
xeD ^ ' ~ xeD^ ' 

Exercise 1.3.5: Let f: D^Randg: D ^ R be functions (D nonempty). 

a) Suppose that f[x) < g(y) for all x £ D and y £ D. Show that 

supf{x) < Mg{x). 

xeD 

b) Find a specific D, f, and g, such that f[x) < g{x)for all x £ D, but 

sup/(x) > inf ^(x). 
x€D Jce^ 

Exercise 1.3.6; Prove Proposition L 3. 7 without the assumption that the functions are bounded. Hint: You 
need to use the extended real numbers. 

Exercise 1.3.7; Let D be a nonempty set. Suppose that f:D^M and g: D ^M. are functions, a) Show that 
sup(/W < sup/(x) +supg(x) and inf (/(x) +g{x)) > inf /(x) + inf ^(x). 

xeD xeD xeD xeD xeD 

b) Find examples where we obtain strict inequalities. 
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1.4 Intervals and the size of M 

Note: 0.5-1 lecture (proof of uncountability o/M can be optional) 

You have seen the notation for intervals before, but let us give a formal definition here. For 
a,b eR such that a < bwe define 

[a,b]:= {xeR:a<x< £>}, 
{a,b) := {x eR : a < X < b}, 
{a,b] := {xeR: a <x<b}, 
[a,b) •.= {xeR:a<x<b}. 

The interval [a, b] is called a closed interval and (a, b) is called an open interval. The intervals of 
the form {a,b\ and [a,b) are called half-open intervals. 

The above intervals were all bounded intervals, since both a and b were real numbers. We define 
unbounded intervals, 

[a,oo) := {jc G M : a < x}, 
(a, oo) := {jc G M : a < x}, 
(-oo,^] ■.= {xeR:x<b}, 
(-oo,^?) := {xgM:x<Z?}. 

For completeness we define (— oo^oo) ;= ]R. 

In short, an interval is a set / C M with at least 2 elements, such that ii a <b < c and a,c E I, 
then b E I. See Exercise 1.4.3. 

We have already seen that any open interval {a,b) (where a < b of course) must be nonempty. 
For example, it contains the number An unexpected fact is that from a set-theoretic perspective, 
all intervals have the same "size," that is, they all have the same cardinality. For example the map 
f{x) := 2x takes the interval [0, 1] bijectively to the interval [0,2]. 

Maybe more interestingly, the function f{x) := tan(x) is a bijective map from to 
M. Hence the bounded interval (—71,71) has the same cardinality as M. It is not completely 
straightforward to construct a bijective map from [0, 1] to say (0, 1), but it is possible. 

And do not worry, there does exist a way to measure the "size" of subsets of real numbers that 
"sees" the difference between [0, 1] and [0,2]. However, its proper definition requires much more 
machinery than we have right now. 

Let us say more about the cardinality of intervals and hence about the cardinality of M. We 
have seen that there exist irrational numbers, that is M \ Q is nonempty. The question is: How 
many irrational numbers are there? It turns out there are a lot more irrational numbers than rational 
numbers. We have seen that Q is countable, and we will show that M is uncountable. In fact, the 
cardinality of M is the same as the cardinality of ^(N), although we will not prove this claim here. 

Theorem 1.4.1 (Cantor). M is uncountable. 
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We give a modified version of Cantor's original proof from 1874 as this proof requires the least 
setup. Normally this proof is stated as a contradiction proof, but a proof by contrapositive is easier 
to understand. 

Proof. Let X C M be a countably infinite subset such that for any two real numbers a <b, there is 
anx eX such that a < x <b. Were M countable, then we could take X = M. If we show that X is 
necessarily a proper subset, then X cannot equal M, and M must be uncountable. 

As X is countably infinite, there is a bijection from N to X. Consequently, we write X as a 
sequence of real numbers xi^xj^x-},,. . such that each number in X is given by Xj for some j E N. 

Let us inductively construct two sequences of real numbers ai,a2,a2,, . . . and Z?i,Z?2,^3, Let 

ai := Xi and bi := xi + 1. Note that ai < bi and xi ^ {a\,b\). For k> \, suppose that a^-i 
and bk-i has been defined. Let us also suppose that {ak-i,bk_\) does not contain any xj for any 
j = l,...,fc-l. 

(i) Define ak := Xj, where j is the smallest 7 G N such ihdXxj E {ak-i,bk-i)- Such an Xj exists 
by our assumption on X. 

(ii) Next, define bk '■= xj where j is the smallest j eN such that xj E (ayt, bk-i). 

Notice that Uk < bk and a^t-i < ak < bk < bk-i- Also notice that {ak,bk) does not contain .Jc^t and 
hence does not contain any Xj for j = 1 , . . . , fc. 

Claim: Uj < bk for all j and k in N. Let us first assume that j < k. Then aj < aj+i < ■ ■ ■ < 
Uk-i < ak < bk- Similarly for j > k. The claim follows. 

Let A = {aj : 7 G N} and 5 = {bj : j G N}. By Proposition 1.2.7 and the claim above we have 

sup A < inf B. 

Define y := sup A. The number y cannot be a member of A. If y = aj for some j, then y < a^+i, 
which is impossible. Similarly y cannot be a member of B. Therefore, aj < y for all j G N and 
y < bj for all j E N. In other words y E {aj, bj) for all j E N. 

Finally we have to show that y ^X. If we do so, then we have constructed a real number not 
in X showing that X must have been a proper subset. Take any Xk EX. By the above construction 
Xk i (fityt, bk) , SO Xk ^ y as yE{ak,bk). 

Therefore, the sequence xi^X2t ■ ■ cannot contain all elements of M and thus M is uncountable. 

□ 

1.4.1 Exercises 

Exercise 1.4.1: For a <b, construct an explicit bijection from {a,b] to (0, 1]. 

Exercise 1.4.2: Suppose f : [0, 1] — )■ (0, 1) is a bijection. Using f, construct a bijection from [—1,1] to M. 

Exercise 1.4.3: Suppose that I cM is a subset with at least 2 elements such that if a < b < c and a,c £ I, 
then it is one of the nine types of intervals explicitly given in this section. Furthermore, prove that the intervals 
given in this section all satisfy this property. 
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Exercise 1.4.4 (Hard); Construct an explicit bijectionfrom (0, 1] to (0, 1). Hint: One approach is as follows: 

First map (1/2, 1] to (0, 1/2], then map (1/4, 1/2] to (1/2,3/4], etc Write down the map explicitly, that is, write 

down an algorithm that tells you exactly what number goes where. Then prove that the map is a bijection. 

Exercise 1.4.5 (Hard); Construct an explicit bijectionfrom [0, 1] to (0, 1). 

Exercise 1.4.6; a) Show that every closed interval [a,b\ is the intersection of countably many open intervals, 
b) Show that every open interval [a.h) is a countable union of closed intervals, c) Show that a possible 
infinite intersection of closed intervals is either empty, a single point, or a closed interval. 

Exercise 1.4.7; Suppose that S is a set of disjoint open intervals in R. That is, if {a,b) € S and {c,d) € S, 
then either {a,b) = {c,d) or {a,b) Ci {c,d) = 0. Prove that S is a countable set. 
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1.5 Decimal representation of the reals 

Note: 1 lecture (optional) 

We often think of real numbers as their decimal representation. For a positive integer n, we find 
the digits dK^dg-i,. . .,d2,d\,dQ for some K, where each dj is an integer between and 9, then 

n = dK\Q'^ + dK-\\0'^~^ + --- + d2\0^ + di\Q) + dQ. 

We often assume dx ^ 0. To represent n we write the sequence of digits: n = dxdK-i ■ ■ ■ d2d\dQ. 
By a (decimal) digit, we mean an integer between and 9. 

Similarly we can represent some rational numbers. That is, for certain numbers x, we can find 
negative integer — M, a positive integer K, and digits dx^ dx- 1 , . . . , <ii , Jo? ^- b • • • , d-M, such that 



X = dKlO^ + dK-i 10^-^ + --- + d2lO^ + dilO + do + d-ilO-^+ d-2l0~^ + ■■■+ J_m10" 



-M 



We write x = dgdg-i ■ ■ ■ dido . d-\d-2 ■ ■ ■ d-M- 

Not every real number has such a representation, even the simple rational number 1/3 does not. 
The irrational number ^/l does not have such a representation either. To get a representation for all 
real numbers we must allow infinitely many digits. 

Let us from now on consider only real numbers in the interval (0, 1]. If we find a representation 
for these, we simply add integers to them to obtain a representation for all real numbers. Suppose 
that we take an infinite sequence of decimal digits: 

0.did2dj, 

That is, we have a digit dj for every j G N. We have renumbered the digits to avoid the negative 
signs. We say that this sequence of digits represents a real number x if 



d\ do di, dy, 

— H — -{ — A \ — 

10 ^ 102 103 10« 



We call 



d\ d2 d^ dn 

io + Io^ + Io3 + '" + To^ 

the truncation of x to n decimal digits. 
Proposition 1.5.1. 

(i) Every infinite sequence of digits 0.did2d3 . . . represents a unique real number x e [0, 1]. 

(ii) For every x G (0, 1] there exists an infinite sequence of digits Q.d\d2d2, . . . that represents x. 
There exists a unique representation such that 

Dn<x<Dn + ^ forallneN. 
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Proof. Let us start with the first item. Suppose we have an infinite sequence of digits 0.did2d3 

We use the geometric sum formula to write 

^ di d2 ds dn 9 9 9 9 

n = _ J —J —J I iL < I I I I 

" 10 ^ 102 103 10" - 10 ^ 102 103 10" 

= ^(i+vio+(Vio)'+---+(vior^) 

In particular, D„ < 1 for all n. As Z)„ > is obvious, we obtain 

< supD„ < 1, 

and therefore 0. Ji J2'^3 ■ ■ • represents a unique number x E [0, 1] . 

We move on to the second item. Take x E (0, 1]. First let us tackle the existence. By convention 
define Dq := 0, then automatically we obtain Dq < x < Do+ 10 Suppose for induction that we 
defined all the digits di,d2, ■ ■ ■ ,dn, and that D„ < x < Dn + 10 We need to define dn+i . 

By the Archimedean property of the real numbers we find an integer j such that x — Dn < 
yj^Q-(n+i) ^g^Q (^j^g least such j and obtain that 

(j- l)10-("+i) < Jc-A, < jlO-("+i). 

Let dn+i '.= J — 1. As D„ < X, then J^+i = j — 1 > 0. On the other hand since x — D,i < 10^" we 
have that j can be at most 10, and therefore J„+i < 9. So dn+i is a decimal digit. Furthermore 
since D^+i = Z)„ + 10~("+^) we obtain D„+i < jc < D^+i + 10"("+^\ We have inductively 

defined an infinite sequence of digits O.Ji ^2^3 As D^ < x for all n, then sup{D„ : n G N} < x. 

As .x: — 10^" < Dn, then x — 10~" < sup{D,n : m G N} for all n. The two inequahties imply sup{D„ : 
nEN} =x. 

What is left to show is the uniqueness. Suppose that 0.eie2^3 ■ ■ ■ is another representation of x, 
let En be the n-digit truncation of 0.eie2^3 ■ • and suppose that En < x < En + 10 " for all n eN. 
Suppose that for some K eN, e„ = dn for all n < K,so Dk \ = Ek-\. Then 

Ek = Dk-i+ckIO-'^ <x<Ek + 10-'^ = DK-i+exlO''^ + 10"^. 

Subtracting D^-i and multiplying by 10^ we get 

eK<{x-DK-i)lO'^ <eK + l. 

Similarly we obtain 

dK< {x-Dk-i)10^ <dK+l. 

That means that both ex and dx are the largest integer j such that j < {x — Dk-i)10^ , and therefore 
= dx- That is, the representation is unique. □ 
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The representation is not unique if we do not require the extra condition in the proposition. For 
example, for the number 1/2 the method in the proof obtains the representation 

0.49999.... 

However, we also have the representation 0.5000. . .. The key requirement that makes the represen- 
tation unique is D„ < x for all n. The inequality a: < Z)„ + 10~" is true for every representation by 
the computation in the beginning of the proof. 

The only numbers that have nonunique representations are ones that end either in an infinite 
sequence of Os or 9s. That is because the only representation for which D„ = x is one where all 
digits past nth one are zero. In this case there are exactly two representations of x (see the exercises). 

Let us give another proof of the uncountability of the reals using decimal representations. This 
is Cantor's second proof, and is probably more well known. While this proof may seem shorter, it is 
because we have already done the hard part above and we are left with a slick trick to prove that M 
is uncountable. This trick is called Cantor diagonalization and finds use in other proofs as well. 

Theorem 1.5.2 (Cantor). The set (0, 1] is uncountable. 

Proof. LetZ := {xi,X2,X2,, . . .} be any countable subset of real numbers in (0, 1]. We will construct 
a real number not in X. Let 

be the unique representation from the proposition, that is J" is the 7th digit of the nth number. Let 
Cn := 1 if J" 7^ 1, and let e„ := 2 if = 1. Let En be the n-digit truncation of 3; = O.eiez^s • • •• 
Because all the digits are nonzero we get that En < En+i < y. Therefore 

En<y<En+lO-'' 

for all n, and the representation is the unique via the proposition. But for every n, the nth digit of y 
is different from the nth digit of Xn, so y Xn. Therefore y ^X, and as X was an arbitrary countable 
subset, (0, 1] must be uncountable. □ 

Using decimal digits we can also find lots of numbers that are not rational. The following 
proposition is true for every rational number, but we give it only for x G (0, 1] for simplicity. 

Proposition 1.5.3. Ifx G (0, 1] is a rational number and x = Q.didzdj, . . ., then the decimal digits 
eventually start repeating. That is, there are positive integers N and P, such that for all n>N, 

dn = dn+p. 

Proof. Let x — p/q for positive integers p and q. Let us suppose that x is one with a unique 
representation, as otherwise we have seen above that both its representations are repeating. 

To compute the first digit we take lOp and divide by q. The quotient is the first digit di and 
the remainder r is some integer between and q—l. That is, di is the largest integer such that 
diq < lOp and then r = lOp — d\q. 
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The next digit is computed by dividing lOr by q, and so on. We notice that at each step there are 
at most q possible remainders and hence at some point the process must start repeating. In fact we 
see that P is at most q. □ 

The converse of the proposition is also true and is left as an exercise. 

Example 1.5.4: The number 

X = 0.101001000100001000001 . . . , 

is irrational. That is, the digits are n zeros, then a one, then n+l zeros, then a one, and so on and so 
forth. The fact that x is irrational follows from the proposition, the digits never start repeating. For 
every P, if we go far enough, we find a 1 that is followed by at least P+l zeros. 

1.5.1 Exercises 

Exercise 1.5.1 (Easy); What is the decimal representation of 1 guaranteed by Proposition 1.5.1 ? Make sure 
to show that it does satisfy the condition. 

Exercise 1.5.2; Prove the converse of Proposition 1.5.3, that is, if the digits in the decimal representation of 
X are eventually repeating, then x must be rational. 

Exercise 1.5.3; Show that real numbers x G (0, 1) with nonunique decimal representation are exactly the 
rational numbers that can be written as for some integers m and n. In this case show that there exist 
exactly two representations ofx. 

Exercise 1.5.4; Let b>2 be an integer Define a representation of a real number in [0, 1] in terms of base b 
rather than base 10 and prove Proposition 1.5.1 for base b. 

Exercise 1.5.5; Using the previous exercise with b = 2 (binary), show that cardinality ofM is the same as 
the cardinality o/^(N), obtaining yet another (though related) proof that M is uncountable. Hint: Construct 
two injections, one from [0, 1] to ^(N) and one from ^3^{H) to [0, 1]. Hint 2: Given a set A C N, let the nth 
binary digit ofx be 1 ifn^A. 

Exercise 1.5.6; Construct a bijection between [0, 1] and [0, 1] x [0, 1]. Hint: consider even and odd digits, 
and be careful about the uniqueness of representation. 
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Chapter 2 

Sequences and Series 



2.1 Sequences and limits 

Note: 2.5 lectures 

Analysis is essentially about taking limits. The most basic type of a limit is a limit of a sequence 
of real numbers. We have already seen sequences used informally. Let us give the formal definition. 

Definition 2.1.1. A sequence (of real numbers) is a function .jc: N — )• M. Instead of x{n) we usually 
denote the nth element in the sequence by Xn- We use the notation {x„}, or more precisely 

{•^w}n=l5 

to denote a sequence. 

A sequence {xn} is bounded if there exists a 5 G M such that 

\xn\ < B for all n G N. 

In other words, the sequence {xn} is bounded whenever the set {xn : n G N} is bounded. 

When we need to give a concrete sequence we often give each term as a formula in terms of n. 
For example, or simply {V«}> stands for the sequence 1, 1/2, 1/3, 1/4, 1/5, . . .. The sequence 

{i/n} is a bounded sequence (5=1 will suffice). On the other hand the sequence {n} stands for 
1,2, 3,4, . . ., and this sequence is not bounded (why?). 

While the notation for a sequence is similar* to that of a set, the notions are distinct. For 
example, the sequence {( — 1)"} is the sequence —1,1,— 1,1,— 1,1,..., whereas the set of values, 
the range of the sequence, is just the set { — 1,1}. We can write this set as {(—1)" : n G N}. When 
ambiguity can arise, we use the words sequence or set to distinguish the two concepts. 

Another example of a sequence is the so-called constant sequence. That is a sequence {c} = 
c,c,CjC, . . . consisting of a single constant c G M repeating indefinitely. 

*[BS] use the notation (jc,,) to denote a sequence instead of {x,,}, which is what [R2] uses. Both are common. 
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We now get to the idea of a limit of a sequence. We will see in Proposition 2. 1 .6 that the notation 
below is well defined. That is, if a limit exists, then it is unique. So it makes sense to talk about the 
limit of a sequence. 

Definition 2.1.2. A sequence {x„} is said to converge to a number .x; G M, if for every e > 0, there 
exists an M G N such that |jc„ —x\<e for all n>M. The number x is said to be the limit of {jc„}. 
We write 



lim Xfi :=x. 



A sequence that converges is said to be convergent. Otherwise, the sequence is said to be 
divergent. 

It is good to know intuitively what a limit means. It means that eventually every number in the 
sequence is close to the number x. More precisely, we can be arbitrarily close to the limit, provided 
we go far enough in the sequence. It does not mean we can ever reach the limit. It is possible, and 
quite common, that there is no jc„ in the sequence that equals the limit x. 

When we write limx,i = x for some real number x, we are saying two things. First, that {xn} is 
convergent, and second that the limit is x. 

The above definition is one of the most important definitions in analysis, and it is necessary to 
understand it perfectly. The key point in the definition is that given any e > 0, we can find an M. 
The M can depend on £, so we only pick an M once we know £. Let us illustrate this concept on a 
few examples. 

Example 2.1.3: The constant sequence 1,1,1,1,... is convergent and the limit is 1. For every 
e > 0, we pickM = 1. 

Example 2.1.4: Claim: The sequence { V«} is convergent and 

Hm - = 0. 

Proof: Given an e > 0, we find an M G N such that < i/m < e (Archimedean property at work). 
Then for all n > M we have that 



\x„-0\ 



1 1 

n M 



Example 2.1.5: The sequence {(—1)"} is divergent. Proof: If there were a hmit jc, then for £ = ^ 
we expect an M that satisfies the definition. Suppose such an M exists, then for an even n>M we 
compute 

1/2 > |.)c„ — x| = 1 1 — x| and y2> \xn+i — x\ = \ — l — x\ . 

But 

2= |l-jc-(-l-x)| < |l-jc| + |-l-x| < 1/2+1/2=1, 
and that is a contradiction. 
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Proposition 2.1.6. A convergent sequence has a unique limit. 

The proof of this proposition exhibits a useful technique in analysis. Many proofs follow the 
same general scheme. We want to show a certain quantity is zero. We write the quantity using the 
triangle inequality as two quantities, and we estimate each one by arbitrarily small numbers. 

Proof. Suppose that the sequence {x„} has the limit x and the limit y. Take an arbitrary e > 0. 

From the definition we find an Mi such that for all n>M\, |x„ — jc| < e/2. Similarly we find an M2 
such that for all n > M2 we have |jc„ — < e/2. Take M := max{Mi,M2}. For n > M (so that both 
n>Mi and n > M2) we have 

\y-x\ = \x„-x-{xn-y)\ 

< \Xn -x\ + \Xn -y\ 

£ e 

As jy — x| < e for all e > 0, then \y—x\ =0 and y = x. Hence the limit (if it exists) is unique. □ 
Proposition 2.1.7. A convergent sequence is bounded. 

Proof. Suppose that {jc„} converges to x. Thus there exists an M G N such that for all n > M we 
have |x„ — x| < 1. Let Bi := |x| + 1 and note that for « > M we have 

I — |-^/| I ^1 

< 1 + |jc| 

The set { |xi I , |x2 1 , . . . , \xm- 1 | } is a finite set and hence let 

B2 :=max{|xi|,|x2|,...,|xM-i|}. 
Let B := max{fii,B2}- Then for all n e N we have 

\Xn\ <B. 

□ 

The sequence {(—1)"} shows that the converse does not hold. A bounded sequence is not 
necessarily convergent. 

Example 2.1.8: Let us show the sequence | ^^^^ | converges and 

r "' + 1 1 
lim = 1. 
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Given e > 0, find M G N such that < e. Then for any n>M we have 



rP- + n 



+ n 



1 



n 



< 
< 



n'^ + n 
n-l 

+ n 
n 



1 



rP- + n /7+ 1 
1 

< £. 



M+1 



Therefore, lim - 



-1 



1. 



2.1.1 Monotone sequences 

The simplest type of a sequence is a monotone sequence. Checking that a monotone sequence 
converges is as easy as checking that it is bounded. It is also easy to find the limit for a convergent 
monotone sequence, provided we can find the supremum or infimum of a countable set of numbers. 

Definition 2.1.9. A sequence {jc„} is monotone increasing if x„ < x„^\ for all n G N. A sequence 
{xn} is monotone decreasing if Xn > Xn+i for all n G N. If a sequence is either monotone increasing 
or monotone decreasing, we simply say the sequence is monotone. Some authors also use the word 
monotonic. 

Theorem 2.1.10. A monotone sequence {x,,} z^' bounded if and only if it is convergent. 
Furthermore, if {xn} is monotone increasing and bounded, then 

lim Xn = sup{x„ : n G N}. 

If {xn} is monotone decreasing and bounded, then 

lim Xn = inf{jc„ : n G N}. 

Proof. Let us suppose that the sequence is monotone increasing. Suppose that the sequence is 
bounded. That means that there exists a B such that x„<B for all n, that is the set {xn : n G N} is 
bounded from above. Let 

X := sup{x„ : n G N}. 

Let e > be arbitrary. As x is the supremum, then there must be at least one M G N such that 
xm > X — £ (because x is the supremum). As is monotone increasing, then it is easy to see (by 
induction) that Xn > xm for all n>M. Hence 



X — x„ < X — xm < £■ 
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Therefore the sequence converges to x. We aheady know that a convergent sequence is bounded, 
which completes the other direction of the implication. 

The proof for monotone decreasing sequences is left as an exercise. □ 

Example 2.1.11: Take the sequence {^}- 

First we note that and hence the sequence is bounded from below. Let us show that it 

is monotone decreasing. We start with y/n+l > i/n (why is that true?). From this inequality we 
obtain 

1 ^ 1 

So the sequence is monotone decreasing and bounded from below (hence bounded). We apply the 
theorem to note that the sequence is convergent and that in fact 

lim = inf < ^ : /7 G N 



We already know that the infimum is greater than or equal to 0, as is a lower bound. Take a number 
> such that < ^ for all n. We square both sides to obtain 



< - for all n G N. 
n 



We have seen before that this implies that Z?^ < (a consequence of the Archimedean property). 
As we also have > 0, then = and so b = 0. Hence b = is the greatest lower bound, and 
lim^ = 0. 



/n 



Example 2.1.12: A word of caution: We have to show that a monotone sequence is bounded 

in order to use Theorem 2.1.10. For example, the sequence {1 + i/2H h i/n} is a monotone 

increasing sequence that grows very slowly. We will see, once we get to series, that this sequence 
has no upper bound and so does not converge. It is not at all obvious that this sequence has no upper 
bound. 

A common example of where monotone sequences arise is the following proposition. The proof 
is left as an exercise. 

Proposition 2.1.13. Let S GM.be a nonempty bounded set. Then there exist monotone sequences 
{xn} and {jn} such that Xn,yn G S and 



sup5=lim.x:„ and inf5=limy„. 
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2.1.2 Tail of a sequence 

Definition 2.1.14. For a sequence {xn}, the K-tail (where K eN) or just the tail of the sequence is 
the sequence starting at ^+ 1, usually written as 

{Xn+K}'^=l or {x„}'^^]^_^_i. 

The main result about the tail of a sequence is the following proposition. 

Proposition 2.1.15. For any K & N, the sequence {xn}'^^i converges if and only if the K-tail 
{xn+K}°^=\ converges. Furthermore, if the limit exists, then 

lim Xn = lim Xn+K- 

n— >cxj n— >oo 

Proof. Define y„ := Xn+K- We wish to show that {jc„} converges if and only if converges. 
Furthermore we wish to show that the limits are equal. 

Suppose that {xn} converges to some x G M. That is, given an e > 0, there exists an M G N such 
that |x — I < £ for all n>M. Note that n>M impUes n + K>M. Therefore, it is true that for all 
n>M we have that 

\x-yn\ = \x-x„+k\ < e. 

Therefore {j„} converges to x. 

Now suppose that {j„} converges to jc G M. That is, given an e > 0, there exists an M' G N such 
that \x-yn\ <e for all n > M' . Let M := M' + K. Then n>M impUes that n-K>M'. Thus, 
whenever n>MwQ have 

\x-Xn\ = \x-yn-K\ < £■ 

Therefore {x„} converges to x. □ 

Essentially, the limit does not care about how the sequence begins, it only cares about the tail of 
the sequence. That is, the beginning of the sequence may be arbitrary. 

2.1.3 Subsequences 

A very useful concept related to sequences is that of a subsequence. A subsequence of {jc„} is a 
sequence that contains only some of the numbers from {x„} in the same order. 

Definition 2.1.16. Let {xn} be a sequence. Let {n;} be a strictly increasing sequence of natural 
numbers (that is ni < n2 < n^ < ■ ■ ■). The sequence 

{Xni}'u=l 

is called a subsequence of {xn}. 
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For example, take the sequence {i/n}. The sequence {1/3"} is a subsequence. To see how these 
two sequences fit in the definition, take := 3z. Note that the numbers in the subsequence must 
come from the original sequence, so 1,0, 1/3, 0, V^? • • • is not a subsequence of {V"}- Similarly order 
must be preserved, so the sequence 1, 1/3, 1/2, 1/5, ... is not a subsequence of {V"}- 

Note that a tail of a sequence is one type of a subsequence. For an arbitrary subsequence, we 
have the following proposition. 

Proposition 2.1.17. If {x„} is a convergent sequence, then any subsequence {x„.} is also convergent 
and 

\imxn = limxni- 

Proof. Suppose that hm„_j.oo jc„ = x. That means that for every e > we have an M G N such that 
for all n > M 

I -^/7 I £ • 

It is not hard to prove (do it!) by induction that n, > /. Hence / > M implies that > M. Thus, for 
all i>M we have 

\xiii x\ ^ £, 

and we are done. □ 

Example 2.1.18: Existence of a convergent subsequence does not imply convergence of the se- 
quence itself. Take the sequence 0, 1 , 0, 1 , 0, 1 , . . .. That is x„ = if n is odd, and x„ = 1 if n is even. 
It is not hard to see that {xn} is divergent, however, the subsequence {x2n} converges to 1 and the 
subsequence {x2n+i} converges to 0. Compare Theorem 2.3.7. 

2.1.4 Exercises 

In the following exercises, feel free to use what you know from calculus to find the limit, if it exists. But you 
must prove that you have found the correct limit, or prove that the series is divergent. 

Exercise 2.1.1: Is the sequence {in} bounded? Prove or disprove. 

Exercise 2.1.2; Is the sequence {«} convergent? If so, what is the limit. 

Exercise 2.1.3: Is the sequence { > convergent? If so, what is the limit. 

I 2?i J 

Exercise 2.1.4: 7^' tJie sequence {2^"} convergent? If so, what is the limit 

r n \ 

Exercise 2.1.5: 7^ the sequence { > convergent? If so, what is the limit. 

+ l J 

r n 1 

Exercise 2.1.6: 7^ the sequence { > convergent? If so, what is the limit. 

\n^+l 
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Exercise 2.1.7; Let {x„} be a sequence. 

a) Show that limx„ = (that is, the limit exists and is zero) if and only ifYim\xn\ = 0. 

b) Find an example such that {|x„|} converges and {jc„} diverges. 

r 2" 1 

Exercise 2.1.8; 7^' the sequence { — > convergent? If so, what is the limit. 

l«! J 

Exercise 2.1.9; Show that the sequence < — = I is monotone, bounded, and use Theorem 2.1.10 to find the 



limit. 

Exercise 2.1.10; Show that the sequence | — J> is monotone, bounded, and use Theorem 2.1.10 to find 



the limit. 

Exercise 2.1.11; Finish the proof of Theorem 2.1.10 for monotone decreasing sequences. 
Exercise 2.1.12; Prove Proposition 2.1.13. 

Exercise 2.1.13; Let {x„} be a convergent monotone sequence. Suppose that there exists a ^ € N such that 

lim x„ = Xk- 

Show that x„ = Xf. for all n>k. 

Exercise 2.1.14; Find a convergent subsequence of the sequence {(—1)"}. 
Exercise 2.1.15; Let {x„} be a sequence defined by 



Xfi .- 



n ifn is odd., 
i/« ifn is even. 



a) Is the sequence bounded? (prove or disprove) 

b) Is there a convergent subsequence? If so, find it. 

Exercise 2.1.16; Let {x„} be a sequence. Suppose that there are two convergent subsequences {x,,, } and 
{xm,}. Suppose that 

\imxnj=a and limx,„,=^, 
where a^b. Prove that {x,,} is not convergent, without using Proposition 2.1.17. 

Exercise 2.1.17; Find a sequence {x„} such that for any j G M, there exists a subsequence {x„;} converging 
to y. 

Exercise 2.1.18 (Easy); Let {x,,} be a sequence and x G M. Suppose that for any e > 0, there is an M such 
that for all n>M, |x„ — x| <£. Show that lim x„ = x. 
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2.2 Facts about limits of sequences 

Note: 2-2.5 lectures, recursively defined sequences can safely be skipped 

In this section we go over some basic results about the hmits of sequences. We start with looking 
at how sequences interact with inequalities. 

2.2.1 Limits and inequalities 

A basic lemma about limits and inequalities is the so-called squeeze lemma. It allows us to show 
convergence of sequences in difficult cases if we find two other simpler convergent sequences that 
"squeeze" the original sequence. 

Lemma 2.2.1 (Squeeze lemma). Let {bn}, and {xn} be sequences such that 

< < b„ for all n eN. 
Suppose that {a„} and {bn} converge and 

lim an = lim bn- 



Then {jc„} converges and 



lim Xn = lim a„ = lim bn- 



The intuitive idea of the proof is illustrated in Figure 2.1. If .x; is the limit of a„ and bn, then if 
they are both within e/3 of x, then the distance between a„ and bn is at most 2e/3. As jc„ is between 
an and bn it is at most 2e/3 from a„. Since a„ is at most e/3 away from x, then jc„ must be at most e 
away from x. Let us follow through on this intuition rigorously. 



an ^ X Xn ^ /■^ bn 

Figure 2.1: Squeeze lemma proof in picture. 



Proof. Let x := lim a„ = lim bn. Let £ > be given. 

Find an Mi such that for all n > Mi we have that |a„ — x| < £/3, and an M2 such that for all 
n > M2 we have \bn-x\ < e/3. Set M := max{Mi,M2}. Suppose that n>M.W& compute 

\Xn an\ — Xn an ^ bn an 

= \bn ~ X + X — an\ 
< \bn — x\ + \x — an\ 

£ £ 2e 
<3 + 3 = T 
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Armed with this information we estimate 

I "^n, ^ /7 1 I I ^ n, I 

2e £ 

And we are done. □ 

Example 2.2.2: One application of the squeeze lemma is to compute limits of sequences using 
limits that are already known. For example, suppose that we have the sequence {^}. Since 

y/n > 1 for all n G N we have 

o<J^<l 

nWn n 



for all n G N. We already know that lim i/« = 0. Hence, using the constant sequence {0} and the 
sequence {V"} i^i the squeeze lemma, we conclude that 

lim = 0. 

Limits also preserve inequalities. 
Lemma 2.2.3. Let and be convergent sequences and 

for all n G N. Then 

lim Xn < lim j„. 

Proof. Let x := lim x„ and y := lim j„. Let £ > be given. Find an M\ such that for all n > Mi we 
have \xn — .^1 < e/2. Find an M2 such that for all n > M2 we have \yn —y\< ^2. In particular, for 
some n > max{Mi,M2} we ha\e x — x„ < e/2 and j„ — j < £/2. We add these inequalities to obtain 

y,i-Xn+x-y < e, or yn-Xn <y~x + e. 

Since Xn < yn we have < j„ — jc„ and hence 0<j — x + £. In other words 

x-y <£. 

Because £ > was arbitrary we obtain x — y < 0, as we have seen that a nonnegative number less 
than any positive £ is zero. Therefore x<y. □ 

We give an easy corollary that can be proved using constant sequences and an application of 
Lemma 2.2.3. The proof is left as an exercise. 
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Corollary 2.2.4. 

(i) Let {xn} be a convergent sequence such that Xn > 0, then 

lim x„>0. 



( ii) Let a, G M and let {jc„} be a convergent sequence such that 
for all n G N. Then 



a<Xn<b, 



a < lim Xn < b. 



In Lemma 2.2.3 and Corollary 2.2.4 we cannot simply replace all the non-strict inequalities 
with strict inequalities. For example, let Xn '■= ^V" '■= V"- Then Xn < yn, Xn < 0, and 

y„ > for all n. However, these inequalities are not preserved by the limit operation as we have 
Um Xn = lim y„ = 0. The moral of this example is that strict inequalities may become non-strict 
inequalities when limits are applied; if we know that Xn < jn for all n, we can only conclude that 

lim Xn < lim y„. 

This issue is a common source of errors. 

2.2.2 Continuity of algebraic operations 

Limits interact nicely with algebraic operations. 

Proposition 2.2.5. Let and {jn} be convergent sequences. 

(i) The sequence {zn}, where Zn '■= +yn, converges and 

lim {xn +yn) = lim Zn = lim Xn + lim 

(ii) The sequence {zn}, where Zn '■= —Jn, converges and 

lim [xn -yn) = lim Zn = lim Xn - Hm Jn- 

(Hi) The sequence {zn}, where Zn '■= ^nyn, converges and 

lim {xnyn) = lim Zn = {ymiXn]{ hm Jn 

Xn 

(iv) IfYimyn 7^ andjn 7^ 0/or all n G N, then the sequence {zn}, where Zn'-= — , converges and 

yn 

Xn limx„ 

hm — = hm z„ = . 

yn hm yn 
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Proof. Let us start with (i). Suppose that {xn} and {y„} are convergent sequences and write 
Zn := Xn+jn- Let X := lim jc„, y : = limy„, and z'-= x + y. 

Let e > be given. Find an Mi such that for all n > Mi we have \xn —x\< e/2. Find an M2 such 
that for all n > M2 we have |y„ -y\ < ^/2. Take M := max{Mi ,M2}. For all n > M we have 

\Zn-z\ = \{Xn+yn)-{x + y)\ = \Xn-X + y„-y\ 

< \xn-x\ + \yn-y\ 

£ £ 

Therefore (i) is proved. Proof of (ii) is almost identical and is left as an exercise. 

Let us tackle (iii). Suppose again that {x,,} and {);„} are convergent sequences and write 
Zn '■= Xnyn- Let X := lim x„, y := limy„, and z '■= xy. 

Let e > be given. As {xn} is convergent, it is bounded. Therefore, find a 5 > such that 
\xn\ < 5 for all n G N. Find an Mi such that for all n > Mi we have |jc„ — x| < 2{\y\+\) ■ ^^'^^ ^2 
such that for all n > M2 we have ly^ — y| < Take M := max{Mi,M2}. For all n > M we have 

\Zn-z\ = \ {Xnyn) - {xy)\ 

\Xnyn {_X~\~Xn '''•n)}'! 
= \Xn{yn-y) + {Xn-x)y\ 
< \Xniyn-y)\ + \iXn-x)y\ 
= \Xn\ \yn-y\ + \Xn-x\ \y\ 

<B\y„-y\ + \xn-x\ \y\ 
£ e 

Finally let us tackle (iv). Instead of proving (iv) directly, we prove the following simpler claim: 
Claim: If {y^ } is a convergent sequence such that lim j„ 7^ and y^ ^ for all n E f^, then 

lim — = — ^ — . 
«^-3;„ hmy„ 

Once the claim is proved, we take the sequence {!/>'«}' multiply it by the sequence {jc„} and 
apply item (iii). 

Proof of claim: Let £ > be given. Let y := lim Find an M such that for all n > M we have 

,2e \y\ 



\yn-y\ <min||3;|^ ^, ^ 
Note that 

\y\ = \y-yn+yn\ < \y-yn\ + \yn\ 
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or in other words \y„\ > \y \ — \y — yn\ - Now jy^ — j| < ^ implies that 



\y\-\yn-y\ > 



\y\ 



Therefore 



and consequently 



\yn\ > \y\ - \y-yn\ > 



\y\ 



1 2 

\yn\ \y\' 



Now we can finish the proof of the claim. 

1 1 

yn y 



< 



< 



y-yn 




yyn 




\y-yn 




\y\ \yn\ 




\y-yn 


2 


\y\ 


\y\ 




2 



\y\ \y\ 



£. 



And we are done. 



□ 



By plugging in constant sequences, we get several easy corollaries. If c G M and {xn} is a 
convergent sequence, then for example 



lim cXfi = c ( lim x„ 



and lim (c + x„) = c + lim x„. 

Similarly with constant subtraction and division. 

As we can take limits past multiplication we can show (exercise) that limx^ = (limx„)* 
G N. That is, we can take limits past powers. Let us see if we can do the same with roots. 

Proposition 2.2.6. Let {xn} be a convergent sequence such that x„ > 0. Then 

hm ^/x^ = ^ / lim X, 



Of course to even make this statement, we need to apply Corollary 2.2.4 to show that lim Xn > 0, 
so that we can take the square root without worry. 
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Proof. Let {jc„} be a convergent sequence and let jc := lim x„. 

First suppose that x = 0. Let £ > be given. Then there is an M such that for all n > M we have 
Xn —\xn\< or in other words ^/x^ < e. Hence 

I ~ = -^/x^ < £■ 

Now suppose that x> (and hence ^/x>0). 



Xn X 



/^+ s/x 

1 

, — I 1= kn 



Xn X\ 



< 



1 



\Xn X\ 



X 



We leave the rest of the proof to the reader. 



□ 



A similar proof works for the kth root. That is, we also obtain limxy*^ = (limx„)^/^. We leave 
this to the reader as a challenging exercise. 

We may also want to take the limit past the absolute value sign. 

Proposition 2.2.7. If{xn} is a convergent sequence, then {\x„\} is convergent and 



lim \xn\ = 



limx„ 

n— >^oo 



Proof. We simply note the reverse triangle inequality 

I I Xfi I I I Xj^ X I • 

Hence if — can be made arbitrarily small, so can I — 1. Details are left to the reader. □ 



2.2.3 Recursively defined sequences 

Now that we know we can interchange limits and algebraic operations, we can compute the limits 
of many sequences. One such class are recursively defined sequences, that is, sequences where the 
next number in the sequence computed using a formula from a fixed number of preceding elements 
in the sequence. 



Example 2.2.8: Let {jc„} be defined by jci :—2 and 



2Xn 
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We must first find out if this sequence is well defined; we must show we never divide by zero. Then 
we must find out if the sequence converges. Only then can we attempt to find the limit. 

First let us prove that Xn > for all n (and the sequence is well defined). Let us show this by 
induction. We know that xi = 2> 0. For the induction step, suppose that Xn > 0. Then 



x,j 2 2Xj^ Xfj-\-2 Xfj-\-2 



2Xri 2Xii 2Xfi 

If Xn > 0, then + 2 > and hence Xn+\ > 0. 

Next let us show that the sequence is monotone decreasing. If we show that — 2 > for all n, 
then x„+i < Xn for all n. Obviously Xj— 2 = 4 — 2 = 2>0. For an arbitrary n we have that 

xl + 2\^ 4 + 4xl + 4-Sxl _xt-4xl + 4 _ {xl-2)^ 



xl+,-2=\-^] -2 



2x„ I 4x1 4x1 

Since jc„ > and any number squared is nonnegative, we have that x^^^ — 2 > for all n. Therefore, 
{x„} is monotone decreasing and bounded, and the limit exists. It remains to find the limit. 
Let us write 

2XnXn^\ = Xj^ -\- 2. 

Since {x„+i} is the 1-tail of it converges to the same limit. Let us define x := limx„. We take 
the limit of both sides to obtain 

2x^=x^ + 2, 
or ^2 = 2. As X > 0, we know that x = \/2. 

You should, however, be careful. Before taking any limits, you must make sure the sequence 
converges. Let us see an example. 

Example 2.2.9: Suppose x\ := 1 and := x^ +x„. If we blindly assumed that the limit exists 
(call it x), then we would get the equation x = x'^ +x, from which we might conclude that x = 0. 
However, it is not hard to show that {xn} is unbounded and therefore does not converge. 

The thing to notice in this example is that the method still works, but it depends on the initial 
value xi. If we set xi := 0, then the sequence converges and the limit really is 0. An entire branch 
of mathematics, called dynamics, deals precisely with these issues. 



2.2.4 Some convergence tests 

It is not always necessary to go back to the definition of convergence to prove that a sequence is 
convergent. We first give a simple convergence test. The main idea is that {xn} converges to x if 
and only if {\x„ —x\} converges to zero. 
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Proposition 2.2.10. Let {xn} be a sequence. Suppose that there is an x and a convergent 
sequence such that 

lim an = 

and 

for all n. Then {xn} converges and lim Xn = x. 

Proof. Let e > be given. Note that a„ > for all n. Find an M G N such that for all n > M we 
have a„ = |a„ — 0| < £. Then, for all n > M we have 

□ 

As the proposition shows, to study when a sequence has a limit is the same as studying when 
another sequence goes to zero. For some special sequences we can test the convergence easily. First 
let us compute the limit of a very specific sequence. 

Proposition 2.2.11. Let c > 0. 

(i) Ifc<\, then 



lim c" = 0. 



(ii) Ifc>\, then {c"} is unbounded. 



Proof. First let us suppose that c > 1 . We write c = 1 + r for some r > 0. By induction (or using 
the binomial theorem if you know it) we see that 

c" = (l+r)"> 1+Rr. 

By the Archimedean property of the real numbers, the sequence { 1 + nr} is unbounded (for any 
number B, we can find an n such that nr >B — I). Therefore c" is unbounded. 
Now let c < 1 . Write c = where r > 0. Then 

1 111 

c" = < < --. 

(1+r)" l+nr rn 

As {i} converges to zero, so does {7^}- Hence, {c"} converges to zero. □ 

If we look at the above proposition, we note that the ratio of the {n + \)th term and the nth term 
is c. We can generalize this simple result to a larger class of sequences. The following lemma will 
come up again once we get to series. 
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Lemma 2.2.12 (Ratio test for sequences). Let {xn} be a sequence such that Xn 7^ Ofor all n and 
such that the limit 



L := lim 



exists. 



(i) IfL< 1, then {xn} converges and \imx„ = 0. 

(ii) IfL > 1, then {x„} is unbounded (hence diverges). 

If L exists, but L = 1, the lemma says nothing. We cannot make any conclusion based on that 
information alone. For example, consider the sequences 1,1,1,1,... and 1,-1,1,-1,1, 



Proof. Suppose L < 1 . As 



> 0, we have that L > 0. Pick r such that L<r<l.Asr-L>0, 



there exists an M G N such that for all n > M we have 



Therefore, 



Fn+l I 



<r-L. 



< r. 



For n> M (that is for n > M + 1) we write 

|'"'^M+l| |'"'^M+2| \^n\ 



\xm\ \xm+i\ 



< \xM\rr---r= \xM\r' 



n—M 



The sequence {r"} converges to zero and hence \xM\r^^r" converges to zero. By Proposition 2.2.10, 
the M-tail of {xn} converges to zero and therefore {jc„} converges to zero. 

Now suppose L > 1. Pick r such that 1 < r < L. As L — r > 0, there exists an M G N such that 
for all n > M we have 



\x 








Xfi 





Therefore, 

Again for « > M we write 

l-^M+ll |-^M+2| 



1 



<L-r. 



> r. 



I 



l-^-ji— 1 1 



> \xm\ rr---r= \xm\ r' 



The sequence {r"} is unbounded (since r > 1), and therefore cannot be bounded (if \xn\ < B 
for all n, then r" < j^^r^ for all n, which is impossible). Consequently, {xn} cannot converge. □ 
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Example 2.2.13: A simple application of the above lemma is to prove that 

lim — = 0. 

Proof: We find that 

2«+i/(„ + l)! 2"+i n\ 2 



2"/nl 2" {n+l)l n+l' 

It is not hard to see that {^^} converges to zero. The conclusion foUows by the lemma. 

2.2.5 Exercises 

Exercise 2.2.1: Prove Corollary 2.2.4. Hint: Use constant sequences and Lemma 2.2.3. 

Exercise 2.2.2: Prove part (ii) of Proposition 2.2.5. 

Exercise 2.2.3: Prove that if {x^} is a convergent sequence, ^ £ N, then 

lim;ic^ = ('limx„') . 



Hint: Use induction. 

Exercise 2.2.4; Suppose that xi := ^ and := x,^. Show that {x„} converges and find lim x„. Hint: You 
cannot divide by zero! 

Exercise 2.2.5: Let Xn '■= " jjse the squeeze lemma to show that {x,,} converges and find the limit. 

Exercise 2.2.6: LetXn '.= \ andy„ := Define Zn '■= — andwn '■= ^. Do |z„| and \wn} converge? What 
are the limits? Can you apply Proposition 2.2.5? Why or why not? 

Exercise 2.2.1: True or false, prove or find a counterexample. If{x„} is a sequence such that {x^} converges, 
then {x„} converges. 

Exercise 2.2.8: Show that 

lim =0. 

Exercise 2.2.9: Suppose that {x,,} is a sequence and suppose that for some x S K, the limit 

L := lim 



n^=» \Xn—x\ 

exists and L <\. Show that {x„} converges to x. 

Exercise 2.2.10 (Challenging): Let {x,,} be a convergent sequence such that x„ > and ^ £ N. Then 



lim xy*^ = ( lim x„ 



Hint: Find an expression q such that 



^y*-x'A _ 1 



i/k 



2.3. LIMIT SUPERIOR, LIMIT INFERIOR, AND BOLZANO-WEIERSTRASS 



61 



2.3 Limit superior, limit inferior, and Bolzano-Weierstrass 

Note: 1-2 lectures, alternative proof of BW optional 

In this section we study bounded sequences and their subsequences. In particular we define 
the so-called limit superior and limit inferior of a bounded sequence and talk about limits of 
subsequences. Furthermore, we prove the so-called Bolzano-Weierstrass theorem*, which is an 
indispensable tool in analysis. 

We have seen that every convergent sequence is bounded, but there exist many bounded divergent 
sequences. For example, the sequence {( — 1)"} is bounded, but we have seen it is divergent. All is 
not lost however and we can still compute certain limits with a bounded divergent sequence. 



2.3.1 Upper and lower limits 

There are ways of creating monotone sequences out of any sequence, and in this way we get the 
so-called limit superior and limit inferior. These limits always exist for bounded sequences. 

If a sequence {x„} is bounded, then the set {x^-.k E N} is bounded. Then for every n the set 
{xk : k > n} is also bounded (as it is a subset). 

Definition 2.3.1. Let {xn} be a bounded sequence. Let a„ := supjjCyt : k> n} and bn '■ = inf{xk '■ 
k> n}. We note that the sequence {a„} is bounded monotone decreasing and {bn} is bounded 
monotone increasing (more on this point below). We define 



limsup.)c„ := lim a„ 
liminf Xn '■= lim b^ 



n- 



For a bounded sequence, liminf and limsup always exist. It is possible to define liminf and 
hmsup for unbounded sequences if we allow °o and It is not hard to generalize the following 
results to include unbounded sequences, however, we restrict our attention to bounded ones. 

Let us see why {a,,} is a decreasing sequence. As a„ is the least upper bound for {xk :k>n}, it 
is also an upper bound for the subset {xk : k> (n+l)}. Therefore the least upper bound for 
{xk :k> (n+l)}, has to be less than or equal to a„, that is, a„ > a^+i . Similarly, b„ is an increasing 
sequence. It is left as an exercise to show that if x„ is bounded, then a„ and bn must be bounded. 

Proposition 2.3.2. Let {xn} be a bounded sequence. Define an and bn as in the definition above. 

(i) limsup Xn = inf{a„ : n G N} and liminf Xn = sup{/?„ : n E N}. 

(ii) liminf.)c„ < limsup x„. 



*Named after the Czech mathematician Bernhard Placidus Johann Nepomuk Bolzano (1781 - 1848), and the 
German mathematician Karl Theodor Wilhelm Weierstrass (1815 - 1897). 
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Proof. The first item in the proposition follows as the sequences {a,,} and {bn} are monotone. 

For the second item, we note that b„ < an, as the inf of a set is less than or equal to its sup. 
We know that {a„} and {b„} converge to the limsup and the liminf (respectively). We apply 
Lemma 2.2.3 to obtain 



lim bn < lim a„. 



Example 2.3.3: Let be defined by 



Xfi .- 



^ if n is odd, 
if n is even. 



□ 



Let us compute the liminf and limsup of this sequence. First the limit inferior: 

liminf jc„ = lim (inf{jc^ : k > n}) = lim = 0. 

For the limit superior we write 

limsup Xn = lim (sup{x^ : k > n}) . 

It is not hard to see that 

sup{xfe ■k>n} = \ I !^ " !^ 

I ^ if n IS even. 

We leave it to the reader to show that the limit is 1. That is, 

lim sup Xn = l- 

Do note that the sequence {xn} is not a convergent sequence. 

We can associate with limsup and liminf certain subsequences. 
Theorem 2.3.4. If{xn} is a bounded sequence, then there exists a subsequence {jc„^} such that 

lim Xnp. = lim sup x„. 

Similarly, there exists a (perhaps different) subsequence {xm^} such that 

lim x,ni, = liminf Xn- 
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Proof. Define a„ := sup{x,t '■ k > n}. Write x := limsup x^ = Hm a„. Define the subsequence as 
follows. Pick := 1 and work inductively. Suppose we have defined the subsequence until for 
some k. Now pick some m> such that 

1 

We can do this as is a supremum of the set {x„ : n> n^ + l} and hence there are elements 

of the sequence arbitrarily close (or even possibly equal) to the supremum. Set n^t+i '■— ^- The 
subsequence is defined. Next we need to prove that it converges and has the right limit. 
Note that fl(„^ j+j) > an^. (why?) and that > Therefore, for every ^ > 1 we have 





arif. 


^iik 


< 




■1+1) 




1 




< 






k' 





Let us show that {xn,} converges x. Note that the subsequence need not be monotone. Let e > 
be given. As {a„} converges to x, then the subsequence {an^} converges to x. Thus there exists an 
Ml e N such that for all A: > Mi we have 



e 
2' 



Find an M2 e N such that 

1 e 
M2-2' 

Take M := max{Mi , M2} and compute. For all A: > M we have 

\x Xfii^ I = -^tik X a.fii^ I 

1 £ 

<t + 2 

1 £ e e 

< h-<- + - = £. 

-M2 2-2 2 

We leave the statement for liminf as an exercise. □ 



2.3.2 Using limit inferior and limit superior 

The advantage of liminf and limsup is that we can always write them down for any (bounded) 
sequence. Working with liminf and limsup is a little bit like working with limits, although there are 
subtle differences. If we could somehow compute them, we could also compute the limit of the 
sequence if it exists, or show that the sequence diverges. 
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Theorem 2.3.5. Let {xn} be a bounded sequence. Then {x„} converges if and only if 

liminfx„ = limsupx„. 

Furthermore, if {xn} converges, then 

lim Xn = liminf x„ = limsup x„. 

n^oo n^oo n^oo 

Proof. Define a„ and bn as in Definition 2.3. 1 . Note that 

bn — -^n — 

If liminf x„ = limsup then we know that {a„} and {bn} have limits and that these two limits are 
the same. By the squeeze lemma (Lemma 2.2.1), {xn} converges and 

lim bn = lim Xn = lim a„. 

n^oa n^oo n^'x' 

Now suppose that {xn} converges to x. We know by Theorem 2.3.4 that there exists a subse- 
quence {xn^} that converges to limsup x„. As {x„} converges to x, we know that every subsequence 
converges to x and therefore limsup x„ = lim = x. Similarly liminf Xn= x. □ 

Limit superior and limit inferior behave nicely with subsequences. 

Proposition 2.3.6. Suppose that {xn} is a bounded sequence and {xn,^} is a subsequence. Then 

liminfx„ < liminfx„^ < lim sup x,,^, < limsup .x;„. 

fc^oo j^^^ n^oo 

Proof. The middle inequality has been proved already. We will prove the third inequality, and leave 
the first inequality as an exercise. 

That is, we want to prove that limsup < limsup Define aj := sup{;cfc : k> j} as usual. 
Also define cj := sup{.x;„j : k > j}. It is not true that cj is necessarily a subsequence of aj. However, 
as nk>k for all k, we have that {xn,^ '■k> j} C {xk '■k> j}. A supremum of a subset is less than or 
equal to the supremum of the set and therefore 



Cj < aj. 



We apply Lemma 2.2.3 to conclude that 



lim Cj < lim aj, 

which is the desired conclusion. □ 



2.3. LIMIT SUPERIOR, LIMIT INFERIOR, AND BOLZANO-WEIERSTRASS 



65 



Limit superior and limit inferior are the largest and smallest subsequential limits. If the 
subsequence in the previous proposition is convergent, then of course we have that liminf Xnf. = 
lim;c„^ = limsup jc„j^. Therefore, 

liminf x„ < \imx„f, < limsupx„. 

n^oo ^^oo ;i^oo 

Similarly we get the following useful test for convergence of a bounded sequence. We leave the 
proof as an exercise. 

Theorem 2.3.7. A bounded sequence {xn} is convergent and converges to x if and only if every 
convergent subsequence {xn,^} converges to x. 

2.3.3 Bolzano-Weierstrass theorem 

While it is not true that a bounded sequence is convergent, the Bolzano-Weierstrass theorem tells us 
that we can at least find a convergent subsequence. The version of Bolzano-Weierstrass that we 
present in this section is the Bolzano-Weierstrass for sequences. 

Theorem 2.3.8 (Bolzano-Weierstrass). Suppose that a sequence of real numbers is bounded. 
Then there exists a convergent subsequence {jc„,.}. 

Proof. We use Theorem 2.3.4. It says that there exists a subsequence whose limit is limsup x„. □ 

The reader might complain right now that Theorem 2.3.4 is strictly stronger than the Bolzano- 
Weierstrass theorem as presented above. That is true. However, Theorem 2.3.4 only applies to the 
real line, but Bolzano-Weierstrass applies in more general contexts (that is, in M") with pretty much 
the exact same statement. 

As the theorem is so important to analysis, we present an explicit proof. The following proof 
generalizes more easily to different contexts. 

Alternate proof of Bolzano-Weierstrass. As the sequence is bounded, then there exist two numbers 
a\ < bi such that ai <Xn< b\ for all n E N. 

We will define a subsequence {jC;j,} and two sequences {a,} and {bi}, such that {a,} is monotone 
increasing, {bi} is monotone decreasing, a,- < Xm < bi and such that lima, = limZ?,. That x,y 
converges follows by the squeeze lemma. 

We define the sequences inductively. We will always have that a, < bi, and that Xn G [a,, bi] for 
infinitely many n gN. We have already defined ai and bi. We take ni := 1, that is Xn^ = xi. 

Now suppose that up to some ^ G N we have defined the subsequence jc„p.)c„2, . . . and 
the sequences a\,a2,. ■ ■ ^a^ and Z?i,Z?2, • • • ^b^. Let y := ^^^y^. Clearly a^ <y < b^. If there exist 
infinitely many j G N such that xj G [ayt, j], then set a^^i := a^, &yt+i •= and pick > nj^ such 
that Xnf.^1 G [ayt, y] . If there are not infinitely many j such that xj G [ak,y] , then it must be true that 
there are infinitely many j G N such that xj G [y,bk]. In this case pick ak+\ := y, bk+i := bk, and 
pick n^+i > nj, such thatXn,^^^ G [y,bk]. 
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Now we have the sequences defined. What is left to prove is that lim a, = lim bi. Obviously the 
limits exist as the sequences are monotone. From the construction, it is obvious that bi — a,; is cut in 
half in each step. Therefore bi+i — a^+i = By induction, we obtain that 

bi-ai 

Let X := lim a,. As {a,} is monotone we have that 

X = sup{a,- : ?■ G N} 

Now let J : = lim bi = inf {bi : z G N}. Obviously y < x as ai < bi for all i. As the sequences are 
monotone, then for any i we have (why?) 

bi-ai 

y-x<bi-ai = —-j-. 

As is arbitrarily small and y — x > 0, we have that y — x = 0. We finish by the squeeze 

lemma. Q 

Yet another proof of the Bolzano- Weierstrass theorem proves the following claim, which is left 
as a challenging exercise. Claim: Every sequence has a monotone subsequence. 



23 A Infinite limits 

If we allow lim inf and lim sup to take on the values oo and — oo, we can apply lim inf and lim sup to 
all sequences, not just bounded ones. For any sequence, we write 

limsup Xn := inf{a„ : n G N}, and liminf Xn := sup{Z?„ : n G N}, 

where a„ := supj^jc^t •.k>n] and bn '■ = inf{xk : k> n} as before. 

We also often define infinite limits for certain divergent sequences. 

Definition 2.3.9. We say that diverges to infinity* if for every M G M, there exists an A^^ G N 
such that for all n > A'^ we have jc„ > M. In this case we write limx„ := oo. Similarly if for every 
M G M there exists an A^^ G N such that for all n > A^^ we have Xn < M, we say diverges to minus 
infinity and we write \imxn := — oo. 

This definition behaves as expected with limsup and liminf, see exercises 2.3.13 and 2.3.14. 

Example 2.3.10: If Xn '■= for odd n and jc„ := n for even n then 

lim n = oo^ limxn does not exist, limsup Xfi — oo. 



* Sometimes it is said that {x,,} converges to infinity. 
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2.3.5 Exercises 

Exercise 2.3.1; Suppose that {x,,} is a bounded sequence. Define an and b„ as in Definition 2.3.1. Show that 
and {bn} are bounded. 

Exercise 2.3.2; Suppose that {x,,} is a bounded sequence. Define bn as in Definition 2.3.1. Show that {bn} 
is an increasing sequence. 

Exercise 2.3.3; Finish the proof of Proposition 2.3.6. That is, suppose that is a bounded sequence and 
{x„j} is a subsequence. Prove liminf x„ < liminf x„j. 

Exercise 2.3 A: Prove Theorem 2.3.7. 

(-1)" 

Exercise 2.3.5; a) Let Xn '■= ,^?i<ilimsupx„ and hminfxn. 

n 

(n 1 

b) Let Xn := , find limsup Xn and liminf x^. 

n 

Exercise 2.3.6; Let {x„} and {jn} be bounded sequences such that x„ < jnfor all n. Then show that 

limsup Xn < limsup y„ 

and 

liminf x„ < liminf 

A7— 7-00 n— >OQ 

Exercise 2.3.7; Let {x„} and {y„} be bounded sequences. 

a) Show that {jc„ is bounded. 

b) Show that 

(Um inf x„ ) + (lim inf y „ ) < lim inf (x„ + y„ ) . 

Hint: Find a subsequence {x„. } o/{x„ +yn} that converges. Then find a subsequence {x,j„, } of 
{x„;} that converges. Then apply what you know about limits. 

c) Find an explicit {x,,} and {y,,} such that 

(lim inf x„ ) + (lim inf yn) < lim inf (x„ +yn)- 

/I— >00 7-00 n— >C>0 

Hint: Look for examples that do not have a limit. 

Exercise 2.3.8; Let {x„} and {yn} be bounded sequences (from the previous exercise we know that {x„ +^,1} 
is bounded). 

a) Show that 

(limsup Xn) + (limsup j„) > limsup (x„ +3'^)- 

Hint: See previous exercise. 
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b) Find an explicit {x,,} and {3^,,} such that 

(limsup Xn) + (limsup yn) > limsup (x„ +yn)- 

Hint: See previous exercise. 

Exercise 2.3.9; IfS <zM.is a set, then x G M a cluster point if for every £ > 0, the set {x — e,x + e)r\S \ {x} 
is not empty. That is, if there are points ofS arbitrarily close to x. For example, S := {^/n : n ^ N} has a 
unique (only one) cluster point 0, but ^ S. Prove the following version of the Bolzano-Weierstrass theorem: 

Theorem. Let 5 C M be a bounded infinite set, tlien tliere exists at least one cluster point of S. 

Hint: If S is infinite, then S contains a countably infinite subset. That is, there is a sequence {x„} of 
distinct numbers in S. 

Exercise 2.3.10 (Challenging); a) Prove that any sequence contains a monotone subsequence. Hint: Call 
n£N a peak if am < a„for all m>n. Now there are two possibilities: either the sequence has at most finitely 
many peaks, or it has infinitely many peaks, 
b) Now conclude the Bolzano-Weierstrass theorem. 

Exercise 2.3.11; Let us prove a stronger version of Theorem 2.3. 7. Suppose that {x,;} is a sequences such 
that every subsequence {x„. } has a subsequence {x„„. } that converges tox. a) First show that {x„ } is bounded, 
b) Now show that {x„} converges to x. 

Exercise 2.3.12; Let {x„} be a bounded sequence. 

a) Prove that there exists an s such that for any r > s there exists a« M G N such that for alln>M we have 
x„ < r. 

b) If s is a number as in a), then prove limsup x„ < s. 

c ) Show that if S is the set of all s as in a ), then lim sup x„ = inf S. 

Exercise 2.3.13 (Easy); Suppose {x,,} is such that liminf x„ = limsup x„ =00. a) Show that {x„} is not 
convergent, and also that neither lim x„ = 00 nor lim x„ = —00 is true, b) Find an example of such a sequence. 

Exercise 2.3.14; Given a sequence {x„}. a) Show that lim x„ = 00 if and only /f liminf x„ = 00. b) Then show 
that lim x„ = —00 if and only if Mm sup x„ = —00. c) If {x„} is monotone increasing, show that either lim x„ 
exists and is finite or lim x„ = °°. 
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2.4 Cauchy sequences 

Note: 0.5-1 lecture 

Often we wish to describe a certain number by a sequence that converges to it. In this case, it is 
impossible to use the number itself in the proof that the sequence converges. It would be nice if we 
could check for convergence without being able to find the limit. 

Definition 2.4.1. A sequence {xn} is a Cauchy sequence* if for every £ > there exists an M G N 
such that for all n > M and all ^ > M we have 



Intuitively what it means is that the terms of the sequence are eventually arbitrarily close to each 
other. We would expect such a sequence to be convergent. It turns out that is true because M has the 
least-upper-bound property. First, let us look at some examples. 

Example 2.4.2: The sequence { V«} is a Cauchy sequence. 

Proof: Given e > 0, find M such that M > 2/e. Then for n,k>M we have that i/n < e/2 and 
< e/2. Therefore for n,k > M wc have 



1 


1 




1 




1 






< 




+ 




n 


k 




n 


k 



e e 
2 2 



Example 2.4.3: The sequence {^^} is a Cauchy sequence. 

Proof: Given e > 0, find M such that M > 2/e. Then for n,k>M we have that i/« < e/2 and 
i/yt < e/2. Therefore for n,k > M we have 



n+l k+1 



k{n+\)-n{k+r 



nk 

kn + k — nk — n 



nk 



nk 



k 




—n 




+ 




nk 


nk 



< 



1 1 e e 
n k 2 2 



Proposition 2.4.4. A Cauchy sequence is bounded. 



*Named after the French mathematician Augustin-Louis Cauchy (1789-1857). 
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Proof. Suppose that {xn} is Cauchy. Pick M such that for all n,k > M we have |jc„ — A:yt| < 1 . In 
particular, we have that for all n > M 

\Xfi xm 1^1. 

Or by the reverse triangle inequality, |x„| — \xm\ < \xn—^M\ < 1- Hence for n> M we have 

\Xn\ < 1 + \xm\ ■ 

Let 

B := max{|jci| , \x2\ , • • • , \xm-i\, 1 + \xm\}- 
Then \x„\ < B for all n G N. □ 

Theorem 2.4.5. A sequence of real numbers is Cauchy if and only if it converges. 

Proof. Let £ > be given and suppose that converges to x. Then there exists an M such that 
for n > M we have 

I ^ 

\ n I 2 

Hence for n>M and k> M we have 

I l-l \ < \ II |^^_ 

Alright, that direction was easy. Now suppose that {jc„} is Cauchy. We have shown that {x„} is 
bounded. If we show that 

liminf Xn = limsup x„, 

then {x„} must be convergent by Theorem 2.3.5. Assuming that liminf and limsup exist is where 
we use the least-upper-bound property. 

Define a := limsup x„ and b := liminf x„. By Theorem 2.3.7, there exist subsequences {xm} 
and {xm-}, such that 

limjc„^. = a and limxm, = b. 

Given an e > 0, there exists an Mi such that for all i > Mi we have \x„. —a\ < e/3 and an M2 such 
that for all i > M2 we have —b\< e/3. There also exists an M3 such that for all n,k> M3 we 
have \xn-Xk\ < ^/s. Let M := max{Mi,M2,M3}. Note that if i > M, then n,- > M and m, > M. 
Hence 

I ^ b\ = I ^2 .'^n,- ~l- Xfij Xjfii -\- Xjiy b I 

^1(2 Xn - 1 -|- \Xfi. Xmj \ ~l" |'''^m/ ^| 
£ £ £ 

As |a — Z?| < £ for all £ > 0, then a = b and the sequence converges. □ 
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Remark 2.4.6. The statement of this proposition is sometimes used to define the completeness 
property of the real numbers. That is, we can say that M is Cauchy-complete (or sometimes 
just complete). We have proved above that as M has the least-upper-bound property, then M is 
Cauchy-complete. We can "complete" Q by "throwing in" just enough points to make all Cauchy 
sequences converge (we omit the details). The resulting field has the least-upper-bound property. 
The advantage of using Cauchy sequences to define completeness is that this idea generalizes to 
more abstract settings. 

2.4.1 Exercises 

Exercise 2.4.1: Prove that {^^^^} is Cauchy using directly the definition of Cauchy sequences. 
Exercise 2.4.2: Let be a sequence such that there exists a < C < 1 such that 

\xyi+\ Xfi\ ^ C \xf, Xji— \ I . 
Prove that {x„} is Cauchy. Hint: You can freely use the formula (for C ^ I) 



l+C + C^- + ---+C" 



1-C 



Exercise 2.4.3 (Challenging); Suppose that F is an ordered field that contains the rational numbers Q, such 
that Q is dense, that is: whenever x,y £ F are such that x <y, then there exists a q gQ such that x < q <y. 
Say a sequence of rational numbers is Cauchy if given any £ G Q with e > 0, there exists an M such 

that for all n,k > M we have \x„ —Xk\ < £. Suppose that any Cauchy sequence of rational numbers has a 
limit in F. Prove that F has the least-upper-bound property. 

Exercise 2.4.4; Let {x„} and {y,,} be sequences such that Yimyn = 0. Suppose that for all k gN and for all 
m>kwe have 

\Xm-Xk\ <yk- 

Show that {x,i} is Cauchy. 

Exercise 2.4.5; Suppose that a Cauchy sequence {x„} is such that for every M € N, there exists a k>M and 
ann>M such that x^ <0 and x„ > 0. Using simply the definition of a Cauchy sequence and of a convergent 
sequence, show that the sequence converges to 0. 
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2.5 Series 

Note: 2 lectures 

A fundamental object in mathematics is that of a series. In fact, when foundations of analysis 
were being developed, the motivation was to understand series. Understanding series is very 
important in applications of analysis. For example, solving differential equations often includes 
series, and differential equations are the basis for understanding almost all of modern science. 



2.5.1 Definition 

Definition 2.5.1. Given a sequence {x„}, we write the formal object 

oo 

^ Xn or sometimes just 

n=l 

and call it a series. A series converges, if the sequence {sn} defined by 

n 

Sn ■= Y,^k=Xl+X2-\ hX„, 

k=l 

converges. The numbers s„ are called partial sums. If x := lim s„, we write 

oo 

y^. = X. 

n=l 

In this case, we cheat a little and treat number. 

On the other hand, if the sequence {sn} diverges, we say that the series is divergent. In this case, 
Y,Xn is simply a formal object and not a number. 

In other words, for a convergent series we have 

oo n 

V x„ = lim y" Xk- 

n=\ k=l 

We should be careful to only use this equality if the limit on the right actually exists. That is, the 
right-hand side does not make sense (the limit does not exist) if the series does not converge. 

Remark 2.5.2. Before going further, let us remark that it is sometimes convenient to start the series 
at an index different from 1. That is, for example we can write 

oo oo 

n=0 n=l 

The left-hand side is more convenient to write. The idea is the same as the notation for the tail of a 
sequence. 
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Remark 2.5.3. It is common to write the series Y^^n as 

Xi +X2+X3-\ 

with the understanding that the ellipsis indicates a series and not a simple sum. We do not use this 
notation as it often leads to mistakes in proofs. 



Example 2.5.4: The series 



converges and the limit is 1. That is, 



n=l 



2" 



n=l k=\ 



Proof: First we prove the following equality 



2k 2" ^' 



The equality is easy to see when n = \ . The proof for general n follows by induction, which we 
leave to the reader. Let Sn be the partial sum. We write 



|1 - 



n 1 



1 

2" 



1 

2"' 



The sequence {^} and therefore {|1 — converges to zero. So, {s„} converges to 1. 
For —1 < r < 1, the geometric series 

OO 

converges. In fact, Y.n=o^" ~ T^- "^^^ proof is left as an exercise to the reader. The proof consists 
of showing that 

n— 1 1 n 

/ - 



E 



l-r 



and then taking the limit. 

A fact we often use is the following analogue of looking at the tail of a sequence. 

Proposition 2.5.5. Let £jc„ be a series. Let M G N. Then 



^ Xn converges if and only if ^ Xn converges. 

n=l n=M 



74 



CHAPTER 2. SEQUENCES AND SERIES 



Proof. We look at partial sums of the two series (for k > M) 



k /M-l \ k 

n=l \ / n=M 



Note that YJ^^^ Xn is a fixed number. Now use Proposition 2.2.5 to finish the proof. 



□ 



2.5.2 Cauchy series 

Definition 2.5.6. A series Y^Xn is said to be Cauchy or a Cauchy series, if the sequence of partial 
sums {sn} is a Cauchy sequence. 

A sequence of real numbers converges if and only if it is Cauchy. Therefore a series is convergent 
if and only if it is Cauchy. 

The series is Cauchy if for every £ > 0, there exists an M G N, such that for every n> M 
and k>M we have 

Without loss of generality we assume that n < k. Then we write 



j=n+l 



< £. 



We have proved the following simple proposition. 



Proposition 2.5.7. The series YL^n is Cauchy if for every e > 0, there exists anM such that for 
every n>M and every k> nwe have 



k 



< £. 



2.5.3 Basic properties 

Proposition 2.5.8. Let Y^nbe a convergent series. Then the sequence {xn} is convergent and 

lim Xn = 0. 

Proof. Let e > be given. As is convergent, it is Cauchy. Thus we find an M such that for 
every n>M we have 

n+l 

£ > = \Xn+l I ■ 

j=n+l 

Hence for every n > M + 1 we have that \xn \ < £. □ 
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So if a series converges, the terms of the series go to zero. The implication, however, goes only 
one way. Let us give an example. 

Example 2.5.9: The series diverges (despite the fact that lim^ = 0). This is the famous 

harmonic series\ 

Proof: We will show that the sequence of partial sums is unbounded, and hence cannot converge. 
Write the partial sums for n = 2^ as: 

^1 = 1, 

1 



^2 = (1) + 



V 1 



J=l \m=2i-l + l ^, 

We note that 1/3 + 1/4 > + = V2 and 1/5 + i/e + + > + + Vs + i/s = 1/2. More 
generally 

2* 1 \ 1 1 

m=2*-' + l ra=2*-i + l 

Therefore 

^.-' + e( E _;!;)si + Ej-' + ^. 

J=l \ni=2*-l + l / ,7=1 

As {j} is unbounded by the Archimedean property, that means that {s2k} is unbounded, and 
therefore {sn} is unbounded. Hence {sn} diverges, and consequently diverges. 

Convergent series are linear. That is, we can multiply them by constants and add them and these 
operations are done term by term. 

Proposition 2.5.10 (Linearity of series). Let a G M and Y^Xn and ^ j„ be convergent series. Then 
( i) OJ-^n ^ convergent series and 

00 00 

n=l n=l 



*The divergence of the harmonic series was known before the theory of series was made rigorous. In fact the proof 
we give is the earhest proof and was given by Nicole Oresme (13237-1382). 
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hi 











( ii) L(-^n + yn) is a convergent series and 



n=l 

Proof. For the first item, we simply write the nth partial sum 

n / n ^ 

k=i \k=i 

We look at the right-hand side and note that the constant multiple of a convergent sequence is 
convergent. Hence, we simply take the limit of both sides to obtain the result. 
For the second item we also look at the nth partial sum 

Y,(^k+yk) = ( Y^xk] + I Y.yk] ■ 

k=i \k=i J \k=i J 

We look at the right-hand side and note that the sum of convergent sequences is convergent. Hence, 
we simply take the Umit of both sides to obtain the proposition. □ 

Note that multiplying series is not as simple as adding, see the next section. It is not true, of 
course, that we can multiply term by term, since that strategy does not work even for finite sums. 
For example, (a + b){c + d) ^ ac + bd. 



2.5.4 Absolute convergence 

Since monotone sequences are easier to work with than arbitrary sequences, it is generally easier 
to work with series £x„ where x„ > for all n. Then the sequence of partial sums is mono- 
tone increasing and converges if it is bounded from above. Let us formalize this statement as a 
proposition. 

Proposition 2.5.11. Ifxn > Ofor all n, then Y^^n converges if and only if the sequence of partial 
sums is bounded from above. 

The following criterion often gives a convenient way to test for convergence of a series. 

Definition 2.5.12. A series £x„ converges absolutely if the series converges. If a series 

converges, but does not converge absolutely, we say it is conditionally convergent. 

Proposition 2.5.13. If the series Y^^n converges absolutely, then it converges. 
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Proof. We know that a series is convergent if and only if it is Cauchy. Hence suppose that ^ is 
Cauchy. That is for every e > 0, there exists an M such that for all A: > M and n> kwe have that 



j=k+i 



j=k+i 



< e. 



We apply the triangle inequality for a finite sum to obtain 



j=k+i 



< £ \xj\<e. 

j=k+l 



Hence is Cauchy and therefore it converges. 



□ 



Of course, if J^x„ converges absolutely, the limits of Y,Xn and \x„\ are different. Computing 
one does not help us compute the other. 

Absolutely convergent series have many wonderful properties for which we do not have space 
in this book. For example, absolutely convergent series can be rearranged arbitrarily. 

We leave as an exercise to show that 

y (-ir 

converges. On the other hand we have already seen that 



n=l 



(-1)" 

diverges. Therefore ^ conditionally convergent subsequence. 



2.5.5 Comparison test and the /7-series 

We have noted above that for a series to converge the terms not only have to go to zero, but they 
have to go to zero "fast enough." If we know about convergence of a certain series we can use the 
following comparison test to see if the terms of another series go to zero "fast enough." 

Proposition 2.5.14 (Comparison test). Let £jc„ and Y^Jn be series such that < jc„ < _y„ for all 

nen. 

(i) IfY^yn converges, then so does Y,Xn- 

(ii) IfY^^n diverges, then so does Y^yn- 
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Proof. Since the terms of the series are all nonnegative, the sequences of partial sums are both 
monotone increasing. Since x„ < y„ for all n, the partial sums satisfy 

t,xk<f^yk. (2.1) 

k=l k=l 

If the series converges the partial sums for the series are bounded. Therefore the right-hand 
side of (2.1) is bounded for all n. Hence the partial sums for J^x^ are also bounded. Since the partial 
sums are a monotone increasing sequence they are convergent. The first item is thus proved. 

On the other hand if diverges, the sequence of partial sums must be unbounded since it is 
monotone increasing. That is, the partial sums for £x„ are bigger than any real number. Putting this 
together with (2.1) we see that for any 5 e M, there is an n such that 

Hence the partial sums for Y^yn are also unbounded, and ][^j„ also diverges. □ 

A useful series to use with the comparison test is the p-series. 
Proposition 2.5.15 (7?-series or the p-test). For p G M, the series 

OO -I 

E- 

ntl 

converges if and only if p> 1. 

Proof. First suppose p < 1. As n > 1, we have ^ >\. Since ^ diverges, we see that the ^ 
must diverge for all p < 1 by the comparison test. 

Now suppose that p> I. We proceed in a similar fashion as we did in the case of the harmonic 
series, but instead of showing that the sequence of partial sums is unbounded we show that it is 
bounded. Since the terms of the series are positive, the sequence of partial sums is monotone 
increasing and will converge if we show that it is bounded above. Let Sk denote the k\h partial sum. 

51 = 1, 

^3 = (l)+f4+ ^ 



IP 3P 
1 1 



1 



k-l /2-'+'-l 

^2'-, = 1 + 1 E „ 

j=l \ m=2J 
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Instead of estimating from below, we estimate from above. In particular, as p > 1, then 2^ < 3^, 
and hence Tp + jp<4p + 4p- Similarly 4p + Tp + 4p + 4p<Tp + Tp + Tp + 4p- Therefore 




As /> > 1, then < 1- Then by using the result of Exercise 2.5.2, we note that 

% (^) 

converges. Therefore 

As {sn} is a monotone sequence, then all Sn < ■^2*-i all n < 2^ — 1. Thus for all n, 

^»<i + E(^)' 

The sequence of partial sums is bounded and hence converges. □ 

Note that neither the p-series test nor the comparison test tell us what the sum converges to. 
They only tell us that a limit of the partial sums exists. For example, while we know that V"^ 
converges it is far harder to find* that the limit is ^^/e. If we treat ^ i/n'^ as a function of p, we get 
the so-called Riemann C, function. Understanding the behavior of this function contains one of the 
most famous unsolved problems in mathematics today and has applications in seemingly unrelated 
areas such as modem cryptography. 

Example 2.5.16: The series ;^rpY converges. 

Proof: First note that -ttt < \ for all n G N. Note that Y X converges by the p-series test. 
Therefore, by the comparison test, -^^rpi converges. 



* Demonstration of this fact is what made the Swiss mathematician Leonhard Paul Euler (1707 - 1783) famous. 
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2.5.6 Ratio test 

Proposition 2.5.17 (Ratio test). Let Y^Xnbe a series such that 

L:= lim^ 

\Xn\ 

exists. Then 

(i) IfL < 1, then Y^Xn converges absolutely. 

(ii) IfL > 1, then £x„ diverges. 

Proof. From Lemma 2.2.12 we note that if L > 1, then Xn diverges. Since it is a necessary condition 
for the convergence of series that the terms go to zero, we know that Y^Xn must diverge. 

Thus suppose that L < 1 . We will argue that Y I must converge. The proof is similar to that 
of Lemma 2.2.12 . Of course L > 0. Pick r such that L < r < 1. As r — L> 0, there exists an M G N 
such that for all n > M 

Un+l I 



Therefore, 

\Xn+l I 



<r-L. 



< r. 

\^n I 

For n> M (that is for n > M + 1) write 

I I I I l-^^M+ll l-^M+al kn| .11 II n-M i\ \ -M\ n 

\Xn\ = \xM\-, Ti \""\ : <\xM\rr---r=\xM\r =[\xM\r Jr. 

For n> M we write the partial sum as 



k=l \k=\ / \k=M+l 

M \ / n 



< I IN 1 + 1 I (Nk->^ 



yk=l / \k=M+l 

(M \ ( n 

IN +(kMk-^) I 
k=l J \k=M+l 

As < r < 1 the geometric series Lr=o'"^ converges, and of course Lr=M+i converges as well 
(why?). We take the limit as n goes to infinity on the right-hand side to obtain 

n f ^ \ / " 

IN< IN +(Nk-^) I r" 

k=\ \k=\ I \/t=M+l 



M 



<|lNl+(Nk-^) I 



2.5. SERIES 



81 



The right-hand side is a number that does not depend on n. Hence the sequence of partial sums of 
£ \xn\ is bounded and ^ \xn\ is convergent. Thus ][^jc„ is absolutely convergent. □ 

Example 2.5.18: The series 

n=l 

converges absolutely. 
Proof: We write 

2(«+i)/(n+l)! .. 2 ^ 
hm — ^ = lim = 0. 

n->oo n^oofi-\-l 

Therefore, the series converges absolutely by the ratio test. 
2.5.7 Exercises 

Exercise 2.5.1: For r^l, prove 



n-1 1 — r" 



Hint: Let s := Lfc=o then compute s{\ — r) = s — rs, and solve for s. 
Exercise 2.5.2; Prove that for — \<r<\we have 

1 



n=0 ^ 



//m?; Use the previous exercise. 

Exercise 2.5.3; Decide the convergence or divergence of the following series. 

3 u^■^ ^ (-1)" 



^^12;^ ^^EV 

00 1 00 

„=i «(« + !) 
Exercise 2.5.4; 

00 00 

a) Prove that if ^ x„ converges, then ^ (x2n +X2«+i) also converges. 

1 n=i 

b) Find an explicit example where the converse does not hold. 

Exercise 2.5.5; For j = 1,2, ... ,n, let {xj^k\'k=i denote n sequences. Suppose that for each j 



is convergent. Then show 



k=l 



j=i \k=\ J k=\ \j=i J 
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Exercise 2.5.6: Prove the following stronger version of the ratio test: Let £;c„ be a series. 

a) If there is an N and a p < I such that for all n> N we have < p, then the series converges 

absolutely. 

b) If there is an N such that for alln>N we have ^^^rp > 1, then the series diverges. 

Exercise 2.5.7 (Challenging): Let be a decreasing sequence such that converges. Show that 
lim nxn = 0. 

~ (-1)" 

Exercise 2.5.8; Show that \ converges. Hint: consider the sum of two subsequent entries. 

Exercise 2.5.9: 

a) Prove that ifY,^n cind £ j„ converge absolutely, then Y^^nyn converges absolutely. 

b) Find an explicit example where the converse does not hold. 

c) Find an explicit example where all three series are absolutely convergent, are not just finite sums, and 
(E^n)(E3'n) 7^ E^nJn- That is, show that series are not multiplied term-by-term. 

Exercise 2.5.10; Prove the triangle inequality for series, that is ifY,^n converges absolutely then 



n=l 



Exercise 2.5.11; Prove the limit comparison test That is, prove that if an > and bn > Ofor all n, and 

an 

< um — <°°, 

n-^oobn 

then either Y,cin andY^bn both converge of both diverge. 
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2.6 More on series 

Note: up to 2-3 lectures ( optional, can safely be skipped or covered partially) 

2.6.1 Root test 

We have seen the ratio test before. There is one more similar test called the root test. In fact, the 
proof of this test is similar and somewhat easier. 

Proposition 2.6.1 (Root test). Let Y,Xn be a series and let 



L := limsup |jc„ 

n— >oo 



1/m 



Then 

(i) IfL < 1 then Y,Xn converges absolutely. 

(ii) IfL > 1 then Y,Xn diverges. 

Proof. If L > 1, then there exists a subsequence such that L = limi^^^ |jc„^| ^Z"*. Let r be such 

that L> r> 1. There exists an M such that for all A: > M, we have \xn,^\^^"'' > r > 1, or in other 
words \xn^\ > r"* > 1. The subsequence and therefore also {|x„|}, cannot possibly converge 

to zero, and so the series diverges. 

Now suppose that L < 1. Pick r such that L < r < 1. By definition of Umit supremum, pick M 
such that for all n > M we have that 

sup{|xfep/^ : A:>n} < r. 

Therefore, for all n > M we have 

I^mI < or in other words |jc„| < r". 

Let n> M and let us estimate the nth partial sum 

n ( M \ ( n \/^\/" 

k=l \k=l / \k=M+l ) \k=\ ) \k=M+\ 

As < r < 1, the geometric series YX=M+\ ^ converges to As ever)1;hing is positive we have 

k=\ \k=\ / 

Thus the sequence of partial sums of |jc„| is bounded, and so the series converges. Therefore Y^Xn 
converges absolutely. □ 
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2.6.2 Alternating series test 

The tests we have so far only addressed absolute convergence. The following test gives a large 
supply of conditionally convergent series. 

Proposition 2.6.2 (Alternating series). Let {xn} be a monotone decreasing sequence of positive 
real numbers such that limx„ = 0. Then 

oo 

n=l 

converges. 

Proof. Write := L^Ll {~^)^^k be the mth partial sum. Then write 

2w n 

Sin = {-'i-fxk = {-Xi +X2) -\ h {-X2n-1 +-^2«) = {-Xlk-l +X2k)- 

k=l k=l 

The sequence {xk} is decreasing and so {—X2k-i +X2k) < for all k. Therefore the subsequence 
{s2n} of partial sums is a decreasing sequence. Similarly, {x2k—X2k-i) > 0, and so 

S2n = -Xi + {x2-X3)-\ h (;C2«-2 -^2«-l) +X2n > -Xl- 

The sequence {s2n} is decreasing and bounded below, so it converges. Let a := lim S2n- 
We wish to show that Um Sm = a (not just for the subsequence). Notice 

S2n+l — S2n +-^2n+l- 

Given e > 0, pick M such that \s2n — < ^/^ whenever 2n > M. Since lim;c„ = 0, we also make M 
possibly larger to obtain X2n+\ < whenever In > M. If In > M, we have \s2n —a\< e/2 < e, so 
we just need to check the situation for S2n+\'- 

\s2n+\-a\ = \s2n-a+X2n+l \ < \s2n - a\ + X2n+l <^/2 + £/2 = £. 

□ 



Notice that this means that there exist conditionally convergent series where the absolute values 
of the terms go to zero arbitrarily slowly. For example. 



, nP 

n=l 



converges for arbitrarily small p>0. 
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2.6.3 Rearrangements 

Generally, absolutely convergent series behave as we imagine they should. For example, absolutely 
convergent series can be summed in any order whatsoever. Nothing of the sort holds for conditionally 
convergent series (see Example 2.6.4 and Exercise 2.6.3). 
Take a series 

oo 

Given a bijective function o : N — t- N, the corresponding rearrangement is the following series: 



k=l 

We simply sum the series in a different order. 

Proposition 2.6.3. Let £x„ be an absolutely convergent series converging to a number x. Let 
o : N ^ N be a bijection. Then Y^X(j(n) '■^ absolutely convergent and converges to x. 

In other words, a rearrangement of an absolutely convergent series converges (absolutely) to the 
same number. 



Proof. Let e > be given. Then take M to be such that 

% and 
2 











— X 


< 









E < 



n=M+\ 



As a is a bijection, there exists a number K such that for each n < M, there exists k < K such that 
o{k) =n. In other words {1,2,...,M} C C7({1,2, . . . , A"}) . 

Then for any N >K, let k := maxa({l, 2, ...,K} and compute 



N 

I 

,«=1 



E ^(^{") 



( 



E-^«+ E ^o{n) 



M 

Y. 

n=i 
M 

M 



N 



\ 



n=l 
a{n)>M 



J 



< 



< 



-X 



X 



N 

+ E 

n=l 
a{n)>M 

k 

+ E 

n=M+l 



\Xyi 



So E-''^c7(n) converges to x. To see that the convergence is absolute, we apply the above argument to 
£ \xn\ to show that £ U(j(„) I converges. □ 
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(—1)"+^ 

Example 2.6.4: Let us show that the alternating harmonic series £ — — , which does not converge 
absolutely, can be rearranged to converge to anything. The odd terms and the even terms both 
diverge to infinity (prove this!): 

oo -j^ oo 

and Y = °°- 



n=l n=l 



Let a„ := — — for simplicity, let an arbitrary number L e M be given, and set a(l) := 1. Suppose 
we have defined o{n) for all n<N.lf 

N 

<^<7(n) < L, 

n=l 

then let o{N +1) := be the smallest odd k eN that we have not used yet, that is G{n) ^ k for all 
n<N. Otherwise let cr(A'^ + 1) := ^ be the smallest even k that we have not yet used. 

By construction cr : N — t- N is one-to-one. It is also onto, because if we keep adding either odd 
(resp. even) terms, eventually we will pass L and switch to the evens (resp. odds). So we switch 
infinitely many times. 

Finally, let be the where we just pass L and switch. For example suppose we have just 
switched from odd to even (so we start subtracting), and let N' >Nhe where we first switch back 
from even to odd. Then 

^ N-l N'-l J 

And similarly for switching in the other direction. Therefore, the sum up to A'^' — 1 is within 
min{cy(Al) ^' Switch infinitely many times we obtain that (7{N) — )■ o° and (7{N') — )■ oo, 

and hence 

oo oo ^ -|^'^C7(n) + l 

Here is an example to illustrate the proof. Suppose L = 1.2, then the order is 

1 + 1/3 - 1/2 + 1/5 + 1/7 + 1/9 - 1/4 +1/11 + 1/13 - 1/6 + l/l5 + l/l7 + l/l9 - l/s + • • • . 

At this point we are no more than 1/8 from the limit. 

2.6.4 Multiplication of series 

As we have already mentioned, multiplication of series is somewhat harder than addition. If we 
have that at least one of the series converges absolutely, than we can use the following theorem. For 
this result it is convenient to start the series at 0, rather than at 1. 
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Theorem 2.6.5 (Mertens' theorem*). Suppose that E^=oa„ and Y,n=o^n convergent series, 

converging to A and B respectively. If at least one of the series converges absolutely, then the series 
Er=o'^« where 

n 

Cn = aobn + aibn-1 H ^anbo = Y^ajbn-j, 

7=0 

converges to AB. 



Proof. Suppose that ^a„ converges absolutely, and let £ > be given. In this proof instead of 
picking complicated estimates just to make the final estimate come out as less than e, let us simply 
obtain an estimate that depends on e and can be made arbitrarily small. 
Write 



n=0 



n=0 



We rearrange the mth partial sum of ^c„: 



AB 



,;i=0 



mi^jbn-j]-AB 



m 



^ BnClm-n 1 — AB 
1=0 / 
m \ 

'£{Bn-B)a,n-n +BA^-AB 



m 



< I Y,\^n-B\ \am-n\ I + \B\ |A„, -A| 

.n=0 



We can surely make the second term on the right hand side go to zero. Pick K such that for all 
m>K we have \Am —A\<£ and also \Bm —B\<£ and finally, as converges absolutely, make 
sure that K is large enough such that for a\\m> K we have 



n=K 



As Y^bn converges, then we have that fi^ax := sup{ |5„ — 5| : n = 0, 1 , 2, . . .} is finite. Take m > 2K, 



*Proved by the German mathematician Franz Mertens (1840 - 1927). 
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then in particular m — K+1 >^+l. So 

n \ / m—K \ I 

^ |5„ — 5| |fl^_„| I = I ^ |5„ — 5| |a^_„| j + I ^ \Bn — B\\am-n\ 

v_n=0 / \«=0 / \n=m-K+\ 

\^n\ 5inax+ e|«n| 
n=K J \n=Q 

< e^max + e \an\ ■ 

Vn=0 / 

Therefore, for m > 2K we have 









do") 


-AB 











yn=0 

(OO 
£ \an\\ +\B\e^e ( 5inax + I £ j +\B\\. 

The expression in the parenthesis on the right hand side is a fixed number. Hence, we make the right 
hand side arbitrarily small by picking a small enough e > 0. So ^^=0 '^n converges to AB. □ 

Example 2.6.6: If both series are conditionally convergent the product series need not even con- 
verge. Suppose we take a,, =bn = ("l)":^/^^- The series YZ^Q^n = Lr=o^« converges by the 
alternating series test, however, it does not converge absolutely as can be seen from the p-test. Let 
us look at the Cauchy product. 

c„^(-l)"| — ^ + ^+ , ^ +---—^ I =(-!)"£ , ^ 

Therefore 

n 1 n 1 

\cn\ = y , > y , = 1. 

The terms do not go to zero and hence X)cn cannot converge. 
2.6.5 Power series 

Fix G I^- A power series about xq is a series of the form 

OO 

^ an{x-XQ)'\ 
n=0 
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A power series is really a function of x, and many important functions in analysis can be written as 
a power series. 

We say that a power series is convergent if there is at least one x^ xq that makes the series 
converge. Note that it is trivial to see that iix = XQ then the series always converges since all terms 
except the first are zero. If the series does not converge for any point x^ xq,wq say that the series 
is divergent. 

Example 2.6.7: The series 

OO , 

E V 

is convergent for all jc G M. This can be seen using the ratio test: For any x notice that 

(l/(n+l)!)y'+i X ^ 

lim ,. , ,, = lim = 0. 

^^oo [\/n\)x" n + 1 

In fact, you may recall from calculus that this series converges to e^. 

Example 2.6.8: The series 

OO 1 

n=\ 

converges absolutely for all jc G (—1,1) via the ratio test: 

(l/(n + l))x"+l 



lim 

n— >oo 



{l/n)x" 



lim \x\ = |jc| < 1. 

n^'=o n+l 



(—1)" 

It converges at x = — 1, as Er=i „ converges by the alternating series test. But the power series 
does not converge absolutely at .x; = —1, because Lr=i n ^^^^ ^'^^ converge. The series diverges at 
x= I. When |x| > 1, then the series diverges via the ratio test. 

The last example actually shows what happens for power series in general. 

Proposition 2.6.9. Let I^a„(x — jcq)" be a power series. If the series is convergent, then either it 
converges at all jc G M, or there exists a number p, such that the series converges absolutely on the 
interval {xq — p,xo + p) and diverges when x < xq — p or x > xq + p. 

The number p is called the radius of convergence of the power series. We write p = oo if 
the series converges for all x, and we write p = if the series is divergent. See Figure 2.2. In 
Example 2.6.8 the radius of convergence is p = 1. 



Proof. Write 



R := lim sup 



l/n 
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diverges converges absolutely diverges 




XQ-P XQ XQ+p 

Figure 2.2: Convergence of a power series. 



We use the root test to prove the proposition: 

L = limsup \an{x — xoy\^^" = |;c — j^ollimsup \an\^^" — \x — xo\R. 

n— >oo n— >oo 

In particular if = oo, then L = oo for any x ^ xq, and the series diverges by the root test. On the 
other hand if i? = 0, then L = for any x, and the series converges absolutely for all x. 

So suppose that <R <oo. The series converges absolutely if 1 > L = R\x — xq\ or in other 
words when 

|x — JCol < V^- 
The series diverges when 1 < L — R\x — xo\ or 

— j:o| > V^- 

Letting p = i/i? completes the proof. □ 

It may be useful to restate what we have learned in the proof as a separate proposition. 
Proposition 2.6.10. Let £a„(jc — jcq)" a power series, and let 



R := limsup |a„ 



l/n 



n— >oo 



If R = oo, the power series is divergent. IfR = 0, then the power series converges everywhere. 
Otherwise the radius of convergence p = 

Usually radius of convergence is written as p = i/i? with the obvious understanding of what p 
should be if = or = oo. 

Convergent power series can be added and multiplied together, and multiplied by constants. The 
proposition has an easy proof using what we know about series in general, and power series in 
particular. We leave the proof to the reader. 

Proposition 2.6.11. Let Y.n=o'^n{x — xo)" and YZ=Q^n{x — xqY be two convergent power series 
with radius of convergence at least p > 0. Then for all x such that \x — xo\ < p we have 

oo \ / \ oo 

Y,an{x-Xo)" + J^Z7„(x-Xo)" = Y,i^n + bn){x-Xof, 
,n=0 I \n=Q I n=0 
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k«=0 / n=0 



and 



^n=0 / \n=0 / n=0 

w/iere c„ = oobn + aibn-i H h a„Z?o. 

That is, after performing the algebraic operations, the radius of convergence of the resulting 
series is at least p. Note that after applying an algebraic operation the radius of convergence could 
increase. See the exercises. 

Let us look at some examples of power series. Polynomials are simply finite power series. 
That is, a polynomial is a power series where the a„ are zero for all n large enough. We expand 
a polynomial as a power series about any point xq by writing the polynomial as a polynomial in 
{x — xq). For example, 2x^ — 3jc + 4 as a power series around xq = 1 is 

2;c2 - 3x + 4 = 3 + (x - 1) + 2(x - 1)^. 

We can also expand rational functions, that is, ratios of polynomials as power series, although 
we will not completely prove this fact here. Notice that a series that for a rational function only 
defines the function on an interval even if the function is defined elsewhere. For example, for the 
geometric series we have that forx G (—1,1) 

oo 

The series diverges when \x\ > 1, even though is defined for all x 7^ 1. 

We can use the geometric series together with rules for addition and multiplication of power 
series to expand rational functions as power series around xq, as long as the denominator is not 
zero at xq. We state without proof that this is always possible, and we give an example of such a 
computation using the geometric series. 



Example 2.6.12: Let us expand ^_^^_^^2 as a power series around the origin (xq = 0) and find the 
radius of convergence. 



92 



CHAPTER 2. SEQUENCES AND SERIES 



First, write 1 +2x + x^ = (1 +x)'^ = (l ~ ■ Next suppose that \x\ < 1. Compute 

/ 1 \ 2 
X I I 



l+2x + x^ \l-{-x) 




n=0 



where using the formula for the product of series we obtain, co = l, ci = — 1 — 1 = —2, C2 
1 + 1 + 1 = 3, etc Therefore we get that for |x| < 1 that 



X 



£(-l)-nr-. 



l+2x + x^ 

The radius of convergence is at least 1. We leave it to the reader to verify that the radius of 
convergence is exactly equal to 1 . 

You can use the method of partial fractions you know from calculus. For example, to find the 
power series for at 0, write 

4- r 11 °° °° 

^2^^ - = 1 (-i)V - E^". 

2.6.6 Exercises 

Exercise 2.6.1: Decide the convergence or divergence of the following series. 

«=1 ^ n=\ " fi=l " „=!(,« + 1J 

Exercise 2.6.2; Suppose that both Y!n=o '^n <^nd Y.n=o converge absolutely. Show that the product series, 
Lr=o where c„ = a^bn H \- a^bo, also converges absolutely. 

Exercise 2.6.3 (Challenging): Let ^ a„ be conditionally convergent. Show that given any number x there 
exists a rearrangement ofY^cin such that the rearranged series converges to x. Hint: See Example 2.6.4. 

Exercise 2.6.4: a ) Let us show that the alternating harmonic series ^ — — has a rearrangement such that 
for any x <y, there exists a partial sum Sfi of the rearranged series such that x ^ Sfi <i y. b) Show that the 
rearrangement you found does not converge. See Example 2.6.4. c) Show that for any x G M, there exists a 
subsequence of partial sums {sj^} of your rearrangement such that lim = x. 
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Exercise 2.6.5; For the following power series, find if they are convergent or not, and if so find their radius 
of convergence. 

oo oo oo oo I oo oo 

a)Y^2V b)Y^nx'^ c)Y^n\x" (^-10)" e)Y^x^" f) l^nlx"'- 

n=0 n=Q n=Q n=0 y'^'^l ' n=0 n=Q 

Exercise 2.6.6; Suppose thatY,ci„x" converges for x = \. a) What can you say about the radius of conver- 
gence? b) If you further know that at x = I the convergence is not absolute, what can you say? 

X 

Exercise 2.6.7; Expand ^ as a power series around xq = and compute its radius of convergence. 

Exercise 2.6.8; a) Find an example where the radius of convergence of^cin^" md J^b„x" are 1, but the 
radius of convergence of the sum of the two series is infinite, b) (Trickier) Find an example where the radius 
of convergence ofY,anX" and Y.t'n^" cif^ I, b^t the radius of convergence of the product of the two series is 
infinite. 

Exercise 2.6.9; Figure out how to compute the radius of convergence using the ratio test. That is, suppose 
that Vanx" is a power series andR := lim ^"^"^i' exists or is oo. Find the radius of convergence and prove your 
claim. 

Exercise 2.6.10; a) Prove that lim n^/" = 1. Hint: Write = \+bn and note bn > 0. Then show that 
(1 +b„)" > "^"2 b^ and use this to show that lim b„ = 0. b) Use the result of part a) to show that ifY,<^nx" 
is a convergent power series with radius of convergence R, then J^nanx'^ is also convergent with the same 
radius of convergence. 

There are different notions of summability (convergence) of a series than just the one we have seen. A 
common one is Cesdro summability*. Let be a series and let s,^ be the nth partial sum. The series is said 
to be Cesaro summable to a if 

,. Si+S2-\ \-Sn 

a = lim . 

Exercise 2.6.11 (Challenging); a) IfY^^n is convergent to a (in the usual sense), show thatY,ci„ is Cesdro 
summable to a. b) Show that in the sense of Cesdro L ( — 1)" is summable to 1/2. c) Let a„ := k when n = k^ 
for some ^ € N, otherwise let an := 0. Show that Y.a„ diverges (in the usual sense, partial sums grow without 
bound), but it is Cesdro summable to (seems a little paradoxical at first sight). 



*Named for the Italian mathematician Ernesto Cesaro (1859 - 1906). 
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Chapter 3 

Continuous Functions 



3.1 Limits of functions 

Note: 2-3 lectures 

Before we define continuity of functions, we need to visit a somewhat more general notion of a 
limit. That is, given a function / : 5 — > R, we want to see how f{x) behaves as x tends to a certain 
point. 

3.1.1 Cluster points 

First, let us return to a concept we have previously seen in an exercise. 

Definition 3.1.1. Let S C M be a set. A number x e R is called a cluster point of S if for every 
e > 0, the set (x — e, X + e) n 5 \ {x} is not empty. 

That is, X is a cluster point of S if there are points of S arbitrarily close to x. Another way of 
phrasing the definition is to say that x is a cluster point of S if for every e > 0, there exists ay E S 
such that y^x and \x — y\ < e. Note that a cluster point of S need not lie in S. 

Let us see some examples. 

(i) The set {i/n : n e N} has a unique cluster point zero. 

(ii) The cluster points of the open interval (0, 1) are all points in the closed interval [0, 1]. 

(iii) For the set Q, the set of cluster points is the whole real line R. 

(iv) For the set [0, 1) U {2}, the set of cluster points is the interval [0, 1]. 

(v) The set N has no cluster points in R. 

Proposition 3.1.2. Let 5 C M. Then x eM. is a cluster point of S if and only if there exists a 
convergent sequence of numbers {x„} such that Xn ^ x, x^ e S, and lim Xn — x. 
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Proof. First suppose that x is a cluster point of S. For any n G N, we pick x„ to be an arbitrary point 
of {x — i/m,x + 1/m) nS\ {x}, which we know is nonempty because jc is a cluster point of S. Then x„ 
is within i/n of x, that is, 

As {i/n} converges to zero, converges tox. 

On the other hand, if we start with a sequence of numbers {x„} in S converging to jc such that 
Xn^x for all n, then for every e > there is an M such that in particular |xm — < £■ That is, 

XM e (x-e,jc + e)n5\{jc}. □ 
3.1.2 Limits of functions 

If a function / is defined on a set S and c is a cluster point of 5, then we can define the limit of 
f{x) as X gets close to c. Do note that it is irrelevant for the definition if / is defined at c or not. 
Furthermore, even if the function is defined at c, the limit of the function as x goes to c could very 
well be different from /(c) . 

Definition 3.1.3. Let / : 5 — )> M be a function and c a cluster point of S. Suppose that there exists an 
L G M and for every e > 0, there exists a 5 > such that whenever jc G 5 \ {c} and \x — c\ < 5, then 

\f{x)-L\<E. 

In this case we say that f{x) converges to L as x goes to c. We also say that L is the limit of f{x) as 
X goes to c. We write 

Um/(jc) :=L, 

x-^c 

or 

f{x) — )■ L as x — )■ c. 
If no such L exists, then we say that the limit does not exist or that / diverges at c. 

Again the notation and language we are using above assumes that the limit is unique even though 
we have not yet proved that. Let us do that now. 

Proposition 3.1.4. Let c be a cluster point ofSdM. and let f: S ^Mbe a function such that f{x) 
converges as x goes to c. Then the limit of f{x) as x goes to c is unique. 

Proof. Let L\ and L2 be two numbers that both satisfy the definition. Take an e > and find a 

5i > such that \ f{x) - Li | < e/2 for all x G S \ {c} with \x-c\ < 5i . Also find ^2 > such that 
\f{x) -L2I < e/2 for all X G 5\{c} with \x-c\ < &i. Put 5 := min{5i,52}. Suppose thatx G S, 
\x — c\<5, and x^ c. Then 

|Li -L2I = |Li -/(x) +/(x) -L2I < |Li -/(x)| + |/(x) -L2I < I + I = 



□ 
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Example 3.1.5: Let / : M ^> M be defined as f{x) := x^. Then 

lim f(x) = limx^ = c^. 

Proof: First let c be fixed. Let e > be given. Take 

d :— min <( 1, 



2|c| + l 



Take x^c such that |jc — c| < 5. In particular, \x — c\ < 1. Then by reverse triangle inequality we 
get 

\x\ — |c| < |x — c| < 1. 
Adding 2 |c| to both sides we obtain |x| + |c| < 2 |c| + 1. We compute 

|/(^) 



21 12 21 

c \ = \x — c 



^\{x + c){x-c)\ 
= \x + c\ \x — c\ 

< {\x\ + \c\) \x — c\ 

< {2\c\ + 1) \x-c\ 



Example 3.1.6: Define / : [0, 1) ^ R by 
Then 



X if ;c > 0, 
1 ifjc = 0. 



lim/(x)=0, 

even though /(O) = 1. 

Proof: Let e > be given. Let 5 := e. Then for x E [0, 1), x 7^ 0, and |x — 0| < 5 we get 

|/(jc)-0| = |jc| < 5 = e. 



3.1.3 Sequential limits 

Let us connect the limit as defined above with limits of sequences. 

Lemma 3.1.7. Let 5 C R and c be a cluster point ofS. Let f: S ^Rbe a function. 

Then f{x) L as x ^ c, if and only if for every sequence {xn} of numbers such thatXn € 5'\ {c} 
for all n, and such that \imxn — c, we have that the sequence {f{xn)} converges to L. 
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Proof. Suppose that f{x) — > L as x — )■ c, and {jc„} is a sequence such that x„ e 5* \ {c} and lim Xn = c. 
We wish to show that {f{xn)} converges to L. Let e > be given. Find a 5 > such that if x G 5 \ {c} 
and \x — c\ < 5, then \ f{x) — L\ < £. As {xn} converges to c, find an M such that for n> M we 
have that — c| < 5. Therefore 

\f{Xn)-L\ <£. 

Thus {f{xn)} converges toL. 

For the other direction, we use proof by contrapositive. Suppose that it is not true that f{x) — L 
as X — )■ c. The negation of the definition is that there exists an e > such that for every 5 > there 
exists an jc G 5 \ {c}, where \x — c\<8 and \f{x) — L\ >£. 

Let us use i/n for d in the above statement to construct a sequence {x,,}. We have that there 
exists an e > such that for every n, there exists a point jc„ G 5 \ {c}, where \xn — c\ < i/« and 
\f{xn) —L\ >£. The sequence {xn} just constructed converges to c, but the sequence {f{xn)} does 
not converge to L. And we are done. □ 

It is possible to strengthen the reverse direction of the lemma by simply stating that {f{xn)} 
converges without requiring a specific limit. See Exercise 3.1.11. 

Example 3.1.8: lim sin(i/x) does not exist, but lim jcsin(i/x) = 0. See Figure 3.1. 




Figure 3.1: Graphs of sin(i/;i:) and xsin(i/x). Note that the computer cannot properly graph sin(i/x) 
near zero as it oscillates too fast. 

Proof: Let us work with sin(i/.v) first. Let us define the sequence Xn := It is not hard to 

see that lim Xn = 0. Furthermore, 

sin(i/x„) = sin(;rn + ^2) = (-!)"• 
Therefore, {sin(i/x„)} does not converge. Thus, by Lemma 3.1.7, lim;c^o sin(i/.i;) does not exist. 
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Now let us look at xsin(i/x). Let x„ be a sequence such that x,i ^ for all n and such that 
Um;c„ = 0. Notice that |sin(?) | < 1 for any ? G M. Therefore, 

|x„sin(i/x„) -0| = \xn\ |sin(i/x„)| < \x„\. 

As Xn goes to 0, then \x„\ goes to zero, and hence {x^ sin(i/x„)} converges to zero. By Lemma 3.1.7, 
lim xsin(i/.Y) = 0. 

Using Lemma 3.1.7, we can start applying everything we know about sequential limits to limits 
of functions. Let us give a few important examples. 

Corollary 3.1.9. Let 5 C M and c be a cluster point ofS. Let / : 5 — )■ M and g: S ^M. be functions. 
Suppose that the limits of f{x) and g{x) as x goes to c both exist, and that 

f{x) < g{x) for all X E S. 

Then 

Mm fix) < lim^(.x:). 

x^c x^c 

Proof. Take {xn} be a sequence of numbers in 5 \ {c} that converges to c. Let 

Li:=lim/(x), and L2 := lim^(.x:). 

x^c x^c 

By Lemma 3.1.7 we know {f{xn)} converges to Li and {g{xn)} converges to L2. We also have 
f{xn) < gi^n)- We obtain Li < L2 using Lemma 2.2.3. □ 

By applying constant functions, we get the following corollary. The proof is left as an exercise. 

Corollary 3.1.10. Let 5 C M and c be a cluster point ofS. Let f: S ^M.be a function. And suppose 
that the limit of f{x) as x goes to c exists. Suppose that there are two real numbers a and b such that 

^ < f{x) < b for all X E S. 

Then 

a < \imf{x) < b. 

Using Lemma 3.1.7 in the same way as above we also get the following corollaries, whose 
proofs are again left as an exercise. 

Corollary 3.1.11. Let 5 C M and c be a cluster point ofS. Let f: 5 — )■ M, g : 5 — M, and h: 5 — )• M 
be functions. Suppose that 

/W < g{x) < h{x) for all x E S, 
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and the limits of f{x) and h{x) as x goes to c both exist, and 

lim f(x) — Ximhix). 

Then the limit ofg{x) as x goes to c exists and 

limg(jc) = lim/(;c) = lim/i(jc). 

Corollary 3.1.12. Let 5 C M and c be a cluster point ofS. Let / : 5 ^ M and g: S ^M. be functions. 
Suppose that limits of f{x) and g{x) as x goes to c both exist. Then 



3.1.4 Limits of restrictions and one-sided limits 

It is not necessary to always consider all of S. Sometimes we may be able to work with the function 
defined on a smaller set. 

Definition 3.1.13. Let /: S ^ M be a function. Let A c S. Define the function /|a : A ^ M by 



The function /|a is called the restriction of / to A. 

The function /|a is simply the function / taken on a smaller domain. The following proposition 
is the analogue of taking a tail of a sequence. 

Proposition 3.1.14. Let S C M, c G M, and let f: 5 — )■ M Z^e a function. Suppose A <Z S is such that 
there is some a > such that A n (c — a, c + Of) = 5 fi (c — a, c + a). 

( i) The point c is a cluster point of A if and only ifc is a cluster point ofS. 

( ii) Supposing c is a cluster point ofS, then f{x) Lasx ^ c if and only iff\A{x) L as x ^ c. 





f{x) _ \imx^cf{x) 



x^cg{x) \ivax-^cg{x)' 



f\A{x) := f{x) 



forx e A. 
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Proof. First, let c be a cluster point of A. Since A (Z S, then if (A \ {c}) n (c — £, c + e) is nonempty 
for every e > 0, then (5 \ {c}) fi (c — £, c + e) is nonempty for every e > 0. Thus c is a cluster point 
of S. Second, suppose that c is a cluster point of S. Then for £ > such that e < a we get that 
(A\ {c}) n (c-£,c + £) = (5\ {c}) n (c-e,c + £), which is nonempty. This is true for all £ < a 
and hence (A \ {c}) fl (c — £, c + £) must be nonempty for all £ > 0. Thus c is a cluster point of A. 

Now suppose that f{x) L as x ^ c. That is, for every £ > there is a 5 > such that if 
xE S\ {c} and \x — c\ < 5, then \f{x) — L| < £. As A C 5, then if x is inA\ {c}, then x is in 5\ {c}, 
and hence /|a('^) L as x ^ c. 

Finally suppose that /Ul-^) L as x ^ c. Hence for every £ > there is a 5 > such that if 
X eA \ {c} and \x — c\ < 5, then j/Ul-^) ~ L\ < £. Without loss of generality assume 5 < a. If 
\x-c\< 5, thenx G 5\{c} if and only if ;c G A \ {c}. Thus \ f{x) -L\ = |/UW -L\ <£. □ 

A common use of restriction with respect to limits are one-sided limits. 

Definition 3.1.15. Let / : 5 — )• M be function and let c be a cluster point of 5 fl (c, oo) . Then if the 
hmit of the restriction of f to SO (c, oo) as a: — )■ c exists, we define 

]im f{x) := lmi/|5n(c.oo) W. 

Similarly if c is a cluster point of 5 fl (— o°, c) and the limit of the restriction as x c exists, we 
define 

lim f{x) :=lim/|5n(-oo,c)W- 

Proposition 3.1.16. Let S C M. be a set such that c is a cluster point of both 5n (— oo, c) and 

S n (c, oo), and let f: 5 — t- M a function. Then 

\imf{x) = L if and only if lim f{x) = lim f{x) = L. 

That is, a limit exists if both one-sided limits exist and are equal, and vice-versa. The proof is a 
straightforward application of the definition of limit and is left as an exercise. The key point is that 

(5n(-oo,c))u(5n(c,oo))=5\{c}. 

3.1.5 Exercises 

Exercise 3.1.1: Find the limit or prove that the limit does not exist 
a)]im^/x,for c>0 b)Yimx^ +x+\,for any c E^ cj Iimx^cos(i/A:) 

X— x^c jc— )-0 

d) lim sin(i/.Y)cos(i/x) e) lim sin(;t:) cos(i/a-) 

x^Q x^O 

Exercise 3.1.2: Prove Corollary 3.L10. 



Exercise 3.1.3: Prove Corollary 3.L1L 



102 



CHAPTER 3. CONTINUOUS FUNCTIONS 



Exercise 3.1.4: Prove Corollary 3.1.12. 

Exercise 3.1.5; Let A C S. Show that if c is a cluster point of A, then c is a cluster point of S. Note the 
difference from Proposition 3.1.14. 

Exercise 3.1.6; Let A C S. Suppose that c is a cluster point of A and it is also a cluster point of S. Let 
/ : S ^Mbe a function. Show that if f{x) L as x ^ c, then /|^ (x) -^Las x ^ c. Note the difference from 
Proposition 3.1.14. 

Exercise 3.1.7; Find an example of a function /: [—1,1] — )• M such that for A := [0,1], the restriction 
/Ia(-x) — )• fli' X — )• 0, but the limit of f{x) as x ^ does not exist Note why you cannot apply Proposi- 
tion 3.1.14. 

Exercise 3.1.8; Find example functions f and g such that the limit of neither f{x) nor g{x) exists as x ^0, 
but such that the limit of f{x) +g(x) exists as x^Q. 

Exercise 3.1.9; Let C[ be a cluster point of A C M and C2 be a cluster point ofB C M. Suppose that f:A^B 
and g : B — )• M are functions such that f{x) C2 as x ^ c\ and g{y) L as y ^ cj. Let h{x) := g (/(x)) 
and show h{x) — )• L as x — )• ci. 

Exercise 3.1.10; Let c be a cluster point of A C M, and f: A be a function. Suppose that for every 
sequence {x„} in A, such that limxn = c, the sequence {/(x„)}~^j is Cauchy. Prove that lim,v^c/(-^) exists. 

Exercise 3.1.11; Prove the following stronger version of one direction of Lemma 3.1. 7: Let S CM, c be a 
cluster point ofS, and / : S be a function. Suppose that for every sequence {x„} in S \ {c} such that 
lim x„ = c the sequence {/(x„)} is convergent. Then show f{x) Las x^ cfor some L G M. 

Exercise 3.1.12; Prove Proposition 3.1.16. 
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3.2 Continuous functions 

Note: 2-2.5 lectures 

You have undoubtedly heard of continuous functions in your schooling. A high school criterion 
for this concept is that a function is continuous if we can draw its graph without lifting the pen from 
the paper. While that intuitive concept may be useful in simple situations, we require a rigorous 
concept. The following definition took three great mathematicians (Bolzano, Cauchy, and finally 
Weierstrass) to get correctly and its final form dates only to the late 1800s. 

3.2.1 Definition and basic properties 

Definition 3.2.1. Let 5 C M, c G 5, and let / : 5 — )■ M be a function. We say that / is continuous at c 
if for every e > there is a 5 > such that whenever x E S and \x — c\ < d, then \ f{x) — f{c) \ < e. 
When /: 5 — > M is continuous at all c E S, then we simply say that / is a continuous function. 

Sometimes we say that / is continuous on A C 5. We then mean that / is continuous at all c eA. 
It is left as an exercise to prove that if / is continuous on A, then /|^ is continuous. 









































1 



Figure 3.2: For \x — c\ < 5, f{x) should be within the gray region. 

Continuity may be the most important definition to understand in analysis, and it is not an easy 
one. See Figure 3.2. Note that d not only depends on e, but also on c; we need not have to pick one 
8 for all c G 5. It is no accident that the definition of continuity is similar to the definition of a limit 
of a function. The main feature of continuous functions is that these are precisely the functions that 
behave nicely with limits. 

Proposition 3.2.2. Suppose that f: S ^M.is a function and c E S. Then 

(i) Ifc is not a cluster point ofS, then f is continuous at c. 

(ii) If c is a cluster point ofS, then f is continuous at c if and only if the limit of f{x) as x ^ c 
exists and 

lim/W=/(c). 

(Hi) f is continuous at c if and only if for every sequence {xn} where Xn E S and limx„ = c, the 
sequence converges to f{c). 
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Proof. Let us start with the first item. Suppose that c is not a cluster point of S. Then there exists 
a 5 > such that 5n(c — 5,c + 5) = {c}. Therefore, for any e > 0, simply pick this given delta. 
The only xeS such that \x- c\ < 5 is x = c. Then \f{x) - f{c) \ = |/(c) - /(c) | = < £. 

Let us move to the second item. Suppose that c is a cluster point of S. Let us first suppose that 
\im^^cf{x) = /(c). Then for every £ > there is a 5 > such that if x E S\{c} and \x — c\ < 5, 
then \ f{x) — f{c) | < £. As |/(c) — /(c) | = < £, then the definition of continuity at c is satisfied. 
On the other hand, suppose that / is continuous at c. For every £ > 0, there exists a 5 > such that 
iox X eS where |x — c| < 5 we have \ f{x) — f{c) \ < £. Then the statement is, of course, still true if 
X e S\{c} C S. Therefore limx^c/(-^) = /(c). 

For the third item, suppose that / is continuous at c. Let {xn} be a sequence such that Xn E S 
and lim Xn = c. Let £ > be given. Find a 5 > such that \ f{x) — f{c) \ < £ for all x G 5 where 
\x — c\ < d. Find an M G N such that for n > M we have \x„ — c\<d. Then for n > M we have that 
\f{xn)-f{c) \ < £, SO {f{xn)} couvcrgcs to /(c). 

Let us prove the converse of the third item by contrapositive. Suppose that / is not continuous at 
c. Then there exists an £ > such that for all 5 > 0, there exists an x G 5 such that \x — c\ < 5 and 
\f{x) — /(c) I > £. Let us define a sequence {xn} as follows. Let XnE She such that \xn — c| < i/n 
and \f{xn) —/(c) I > £. Now {x„} is a sequence of numbers in S such that limx„ = c and such 
that \f{xn) — f{c) I > £ for all n eN. Thus {f{x„)} does not converge to /(c). It may or may not 
converge, but it definitely does not converge to /(c) . □ 

The last item in the proposition is particularly powerful. It allows us to quickly apply what we 
know about limits of sequences to continuous functions and even to prove that certain functions are 
continuous. It can also be strengthened, see Exercise 3.2.13. 

Example 3.2.3: /: (0,°°) — t- M defined by f{x) := i/x is continuous. 

Proof: Fix c G (0,oo). Let {jc„} be a sequence in (0,°°) such that lim;c„ = c. Then we know that 

lim f{x„) = lim — = = -= f{c). 

n^oo n^'^Xn limx,, C 

Thus / is continuous at c. As / is continuous at all c G (0, oo), / is continuous. 

We have previously shown that lim^^c^^ = c^ directly. Therefore the function x^ is continuous. 
We can use the continuity of algebraic operations with respect to limits of sequences, which we 
proved in the previous chapter, to prove a much more general result. 

Proposition 3.2.4. Let / : M — )■ M a polynomial. That is 

fix) = a^x'^ -\- a^^ix'^^^ -\ hflix + ao, 



for some constants ao, ai , . . • , aj. Then f is continuous. 
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Proof. Fix c G M. Let {jc„} be a sequence such that limx„ = c. Then 

\imf{xn) = lim (adxi + ad-ixi^^-\ \-aiXn + ao 

= a^(limA:„)'^ + a^_i(limx,J'^^^ H hai(limx,j) +ao 

= adc'^ + ad-ic^^^ H haic + ao = /(c). 

Thus / is continuous at c. As / is continuous at all c G M, / is continuous. 



□ 



By similar reasoning, or by appealing to Corollary 3.1.12, we can prove the following. The 
details of the proof are left as an exercise. 

Proposition 3.2.5. Let f: 5 — t- M and g: 5 — t- M be functions continuous at c E S. 



( i) The function /? : 5 — )■ M defined by h{x) 

( ii) The function h: S ^M. defined by h{x) 
(Hi) The function h: S ^M. defined by h{x) 



f{x) + g{x) is continuous at c. 
f{x) — g{x) is continuous at c. 
f{x)g{x) is continuous at c. 



( iv) Ifg{x) 7^ Ofor all x E S, the function h: S -^M. defined by h{x) :- 



M 



IS continuous at c. 



Example 3.2.6: The functions sin(;c) and cos(x) are continuous. In the following computations 
we use the sum-to-product trigonometric identities. We also use the simple facts that |sin(x)| < \x\, 
\cos{x) \ < 1, and |sin(x)| < 1. 



|sin(x) — sin(c) 



2 sin 



= 2 
<2 
<2 



sm 

sin 

x — c 



x — c 

~Y~ 

x — c 

~Y~ 

x — c 



COS 



COS 



x-\-c 
x-\-c 



cosU 



' COS c 



2 

-2 sin 

sin 
sin 



2 

x — c 
x — c 



sm 



sm 



x-\-c 
x + c 



= 2 
<2 
<2 

The claim that sin and cos are continuous follows by taking an arbitrary sequence converg- 
ing to c. Details are left to the reader. 
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3.2.2 Composition of continuous functions 

You have probably already realized that one of the basic tools in constructing complicated functions 
out of simple ones is composition. A useful property of continuous functions is that compositions of 
continuous functions are again continuous. Recall that for two functions / and g, the composition 
fog is defined by {fog){x) ■=f[g{x)). 

Proposition 3.2.7. Let A, 5 C M and / : 5 — t- M and g: A ^ B be functions. If g is continuous at 
c G A and f is continuous at g{c), then f o g : A ^ M. is continuous at c. 

Proof. Let be a sequence in A such that lim.x:„ = c. Then as g is continuous at c, then {g{xn)} 
converges to g{c). As / is continuous at g{c), then {f{gixn))} converges to /(g(c)). Thus fog is 
continuous at c. □ 

Example 3.2.8: Claim: (sin(i/.Y))^ is a continuous function on (0,oo). 

Proof: First note that i/x is a continuous function on (0,°°) and sin(jc) is a continuous function 
on (0,oo) (actually on all of M, but (0,°°) is the range for i/x). Hence the composition sin(i/A:) is 
continuous. We also know that is continuous on the interval (—1,1) (the range of sin). Thus the 
composition (sin(i/.T)) is also continuous on (0,°°). 

3.2.3 Discontinuous functions 

When / is not continuous at c, we say that / is discontinuous at c, or that it has a discontinuity at c. 
If we state the contrapositive of the third item of Proposition 3.2.2 as a separate claim we get an 
easy to use test for discontinuities. 

Proposition 3.2.9. Let f: S ^M.be a function. Suppose that for some c G 5, there exists a sequence 
{xn}, Xn G 5, and lim Xn = c such that {f{xn)} does not converge to f{c) (or does not converge at 
all), then f is not continuous at c. 



is not continuous at 0. 

Proof: Take the sequence { — i/;?}. Then /( — V") = ~1 so lim f{—^/n) = — 1, but /(O) = 1. 
Example 3.2.11: For an extreme example we take the so-called Dirichlet function. 



Example 3.2.10: The function / : M ^ M defined by 





1 if X is rational, 
if X is irrational. 
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The function / is discontinuous at all c G M. 

Proof: Suppose that c is rational. Take a sequence {x„} of irrational numbers such that lim jc„ = c 
(why can we?). Then f{xn) = and so lim /(jc„) = 0, but /(c) = 1. If c is irrational, take a sequence 
of rational numbers {xn} that converges to c (why can we?). Then lim /(x„) — 1, but /(c) = 0. 

As a final example, let us yet again test the limits of your intuition. Can there exist a function 
that is continuous on all irrational numbers, but discontinuous at all rational numbers? There are 
rational numbers arbitrarily close to any irrational number. Perhaps strangely, the answer is yes. 
The following example is called the Thomae junctiotf_ov the popcorn function. 

Example 3.2.12: Let / : (0, 1) ^ M be defined by 




i/fc if X = in/k where m, G N and m and k have no common divisors, 
if X is irrational. 



Then / is continuous at all irrational c G (0, 1) and discontinuous at all rational c. See the graph of 
/ in Figure 3.3. 



Figure 3.3: Graph of the "popcorn function." 



Proof: Suppose that c = '^/k is rational. Take a sequence of irrational numbers such that 
limx^ = c. Then lim f{xn) = limO = 0, but /(c) = ^/k 7^ 0. So / is discontinuous at c. 

Now let c be irrational, so /(c) = 0. Take a sequence {xn} of rational numbers in (0, 1) such that 
lim Xn = c. Given e > 0, find ^ G N such that i/a: < e by the Archimedean property. If m/k G (0, 1) 
is lowest terms (no common divisors), then m < k. So there are only finitely many rational numbers 
in (0, 1) whose denominator k in lowest terms is less than K. Hence there is an M such that for 
n>M, all the rational numbers Xn have a denominator larger than or equal to K. Thus for n>M 

\f{Xn)-0\=f{Xn)<^/K<£. 

Therefore / is continuous at irrational c. 



* Named after the German mathematician Johannes Karl Thomae (1840 - 1921). 
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3.2.4 Exercises 

Exercise 3.2.1; Using the definition of continuity directly prove that / : M — )■ M defined by f{x) := is 
continuous. 

Exercise 3.2.2: Using the definition of continuity directly prove that f: (0,oo) — )• M defined by f{x) := i/x is 
continuous. 

Exercise 3.2.3: Let f : 

I X if X is rational, 
I x^ if X is irrational. 

Using the definition of continuity directly prove that f is continuous at 1 and discontinuous at 1. 
Exercise 3.2.4: Let f: R^Rbe defined by 






Is f continuous? Prove your assertion. 
Exercise 3.2.5: Let f: M — )• M Zj^ defined by 

m := 

Is f continuous? Prove your assertion. 
Exercise 3.2.6: Prove Proposition 3.2.5. 

Exercise 3.2.7: Prove the following statement. Let 5 C M and A <Z S. Let / : S ^Mbe a continuous function. 
Then the restriction /|a is continuous. 

Exercise 3.2.8: Suppose that 5 C M. Suppose that for some c G M and a>0,we have A = (c — (X,c + a) C S. 
Let f: S be a function. Prove that if f\A is continuous at c, then f is continuous at c. 

Exercise 3.2.9: Give an example of functions / : M — J- R and g : M — )■ M such that the function h defined by 
h{x) := f{x) +g{x) is continuous, but f and g are not continuous. Can you find f and g that are nowhere 
continuous, but h is a continuous function? 

Exercise 3.2.10: Let / : M — )• M and g: M — t- M Z?e continuous functions. Suppose that for all rational 
numbers r, f{r) = g{r). Show that f{x) = g{x)for all x. 

Exercise 3.2.11: Let f: M — )• M &e continuous. Suppose that f{c) > 0. Show that there exists an a > such 
that for allx € {c — a,c + a) we have f{x) > 0. 

Exercise 3.2.12: Let f: Z ^M. be a function. Show that f is continuous. 

Exercise 3.2.13: Let / : S ^Mbe a function and c € 5, such that for every sequence {x„} in S with lim x^ = c, 
the sequence {/(^n)} converges. Show that f is continuous at c. 
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3.3 Min-max and intermediate value theorems 

Note: 1.5 lectures 

Let us now state and prove some very important results about continuous functions defined on 
the real line. In particular, on closed bounded intervals of the real line. 

3.3.1 Min-max theorem 

Recall that a function / : [a, Z?] — )► M is bounded if there exists a 5 G M such that \ f{x) \ < B for all 
X E [a,b\. We have the following lemma. 

Lemma 3.3.1. Let f: [a, b] ^"Kbe a continuous function. Then f is bounded. 

Proof. Let us prove this claim by contrapositive. Suppose that / is not bounded, then for each 
n eN, there is an x„ G [a,b], such that 

\fixn) \ >n. 

Now {xn} is a bounded sequence as a < Xn < b. By the Bolzano- Weierstrass theorem, there is a 
convergent subsequence {jc„,.}. Let x := lim.x;„, . Since a < x^ < b for all i, then a < x < b. The 
limit lim f{xni) does not exist as the sequence is not bounded as |/(x„,) \>ni> i. On the other hand 
f{x) is a finite number and 

/W = /( limx,,, I . 

Thus / is not continuous at x. □ 

In fact, for a continuous /, we will see that the minimum and the maximum are actually achieved. 
Recall from calculus that / : 5 — )■ M achieves an absolute minimum at c G 5 if 

fix) > f{c) for all xES. 

On the other hand, / achieves an absolute maximum at c G 5 if 

fix) < f{c) for all XES. 

We say that / achieves an absolute minimum or an absolute maximum on S if such a c G 5 exists. If 
5 is a closed and bounded interval, then a continuous / must have an absolute minimum and an 
absolute maximum on S. 

Theorem 3.3.2 (Minimum-maximum theorem). Let f: [a,b\ — )■ M a continuous function. Then 
f achieves both an absolute minimum and an absolute maximum on [a,b\. 
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Proof. We have shown that / is bounded by the lemma. Therefore, the set /( [a, b]) = {f{x) : x G 
[a, b] } has a supremum and an infimum. From what we know about suprema and infima, there exist 
sequences in the set f{[a,b]) that approach them. That is, there are sequences {f{xn)} and {f{yn)}, 
where x„,y„ are in [a,b], such that 

lim f{xn) = inf f{[a,b]) and lim f{y„) = sup/([a,Z7]). 

We are not done yet, we need to find where the minimum and the maxima are. The problem is that 
the sequences {x„} and {j„} need not converge. We know that {xn} and {y„} are bounded (their 
elements belong to a bounded interval [a,^?]). We apply the Bolzano- Weierstrass theorem. Hence 
there exist convergent subsequences {x„^] and {jm,}- Let 

X := Yrnxxm and y := lim 

Then as a < x,,, < b, we have that a <x <b. Similarly a <y < b, so x and y are in [a, b] . We apply 
that a limit of a subsequence is the same as the limit of the sequence, and we apply the continuity of 
/ to obtain 

inf f{[a,b]) = lim f{xn) = lim/(jc„,.) 

Similarly, 

supf{[a,b]) = lim/(m„) = lim 
Therefore, / achieves an absolute minimum at x and / achieves an absolute maximum at y. □ 

Example 3.3.3: The function f{x) := x^ -\-l defined on the interval [—1,2] achieves a minimum at 
x = when /(O) = 1 . It achieves a maximum at x = 2 where /(2) = 5. Do note that the domain of 
definition matters. If we instead took the domain to be [—10, 10], then x = 2 would no longer be a 
maximum of /. Instead the maximum would be achieved at either x= 10orjc = —10. 

Let us show by examples that the different hypotheses of the theorem are truly needed. 

Example 3.3.4: The function f{x) : = x, defined on the whole real line, achieves neither a minimum, 
nor a maximum. So it is important that we are looking at a bounded interval. 

Example 3.3.5: The function f{x) := i/x, defined on (0, 1) achieves neither a minimum, nor a 
maximum. The values of the function are unbounded as we approach 0. Also as we approach x=l, 
the values of the function approach 1, but f{x) > I for all x E (0, 1). There is no jc G (0, 1) such that 
f{x) = 1 . So it is important that we are looking at a closed interval. 

Example 3.3.6: Continuity is important. Define /: [0, 1] — > M by f{x) := i/x for jc > and let 
/(O) := 0. Then the function does not achieve a maximum. The problem is that the function is not 
continuous at 0. 



/ 



( limjc„,. ) = 
■( limy mi ) 



fiy)- 
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3.3.2 Bolzano's intermediate value theorem 

Bolzano's intermediate value theorem is one of the cornerstones of analysis. It is sometimes called 
only intermediate value theorem, or just Bolzano's theorem. To prove Bolzano's theorem we prove 
the following simpler lemma. 

Lemma 3.3.7. Let f: [a,b] -^R be a continuous function. Suppose that f(a) < and f{b) > 0. 
Then there exists a number c G {a,b) such that f{c) = 0. 

Proof. We define two sequences and {bn} inductively: 

(i) Let ai := a and bi := b. 

(ii) iff(^^^ > 0, let fl„+i := an and b^+i := 

(iii) If / (^f^) < 0, let a„+i := and bn+i := bn. 

From the definition of the two sequences it is obvious that if an < bn, then < b„-\-i. Thus by 
induction a„ < bn for all n. Furthermore, < a„+i and bn > bn+i for all n. Finally we notice that 

, bn 
t>n+\ —(^n+l — 2 ■ 

By induction we see that 

bn-an= 2n-T = ^^^'\b-a). 

As {an} and {bn} are monotone and bounded, they converge. Let c := lim a„ and d := lim bn. 
As an < b„ for all n, then c < d. Furthermore, as a„ is increasing and bn is decreasing, c is the 
supremum of a„ and d is the infimum of the bn. Thus d — c < bn — an for all /?. So 

|<i — c| = d — c < bn — an = 2^^"{b — a) 

for all n. As 2^^"{b — a) — )■ as n — )• oo, we see that c = d. By construction, for all n we have 

f{an) < and f{bn) > 0. 

We use the fact that lim a„ = lim bn = c and the continuity of / to take limits in those inequalities 
to get 

/(c) = lim f{an) < and f(c) = lim f{bn) > 0. 
As /(c) > and /(c) < 0, we conclude /(c) = 0. Obviously, a < c <b. □ 

Notice that the proof tells us how to find the c. The proof is not only useful for us pure 
mathematicians, but it is a useful idea in applied mathematics. 

Theorem 3.3.8 (Bolzano's intermediate value theorem). Let / : [a,b] —^M.be a continuous function. 
Suppose that there exists a y such that f{a) <y < fib) or f{a) > y > fib). Then there exists a 
c G (a,^) such that fic) =y. 



112 



CHAPTER 3. CONTINUOUS FUNCTIONS 



The theorem says that a continuous function on a closed interval achieves all the values between 
the values at the endpoints. 

Proof. If f{a) <y < f{b), then define g{x) := f{x) — y. Then we see that g{a) < and g{b) > 
and we can apply Lemma 3.3.7 to g. If g{c) = 0, then /(c) = y. 

Similarly if f{a) > y > f{b), then define g{x) :=y — f{x). Then again g{a) < and g{b) > 
and we can apply Lemma 3.3.7. Again if g{c) = 0, then /(c) =y. □ 

If a function is continuous, then the restriction to a subset is continuous. So if /: 5 — )■ M is 
continuous and [a,b] C S, then f\[a,b] ^l^o continuous. Hence, we generally apply the theorem to 
a function continuous on some large set S, but we restrict attention to an interval. 

Example 3.3.9: The polynomial f(x) :=x^ — 2x^ +x—l has a real root in (1, 2). We simply notice 
that /( 1 ) = — 1 and /(2) = 1 . Hence there must exist a point c G (1,2) such that /(c) = 0. To find 
a better approximation of the root we could follow the proof of Lemma 3.3.7. For example, next 
we would look at 1.5 and find that /(1. 5) = —0.625. Therefore, there is a root of the equation 
in (1.5,2). Next we look at 1.75 and note that /(1. 75) ~ —0.016. Hence there is a root of / in 
(1.75,2). Next we look at 1.875 and find that /(1. 875) ^ 0.44, thus there is a root in (1.75,1.875). 
We follow this procedure until we gain sufficient precision. 

The technique above is the simplest method of finding roots of polynomials. Finding roots of 
polynomials is perhaps the most common problem in applied mathematics. In general it is hard to 
do quickly, precisely and automatically. We can use the intermediate value theorem to find roots for 
any continuous function, not just a polynomial. 

There are better and faster methods of finding roots of equations, such as Newton's method. 
One advantage of the above method is its simplicity. The moment we find an initial interval where 
the intermediate value theorem applies, we are guaranteed to find a root up to a desired precision in 
finitely many steps. Furthermore, the method only requires a continuous function. 

The theorem guarantees at least one c such that /(c) = y, but there may be many different 
roots of the equation /(c) = y. If we follow the procedure of the proof, we are guaranteed to find 
approximations to one such root. We need to work harder to find any other roots. 

Polynomials of even degree may not have any real roots. For example, there is no real number x 
such that .x;^ + 1 = 0. Odd polynomials, on the other hand, always have at least one real root. 

Proposition 3.3.10. Let f{x) be a polynomial of odd degree. Then f has a real root. 

Proof. Suppose / is a polynomial of odd degree d. We write 



f{x) = ajx'^ + ad-ix'' ^ + 



■ + aix + ao, 



where a^ 7^ 0. We divide by to obtain a polynomial 



g{x) ■.= x'' + bd-ix''-' + - 



+ bix + bo, 
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where bk = "k/aa. Let us show that g{n) is positive for some large n e N. So 



bd-in'^^^ + --- + bin + bQ 



bd-in"^^^ + --- + bin + bo\ 



< 



< 



\bd-i\n'^-^ + --- + \bi\n + \bo\ 



\bd-i\n''-^ + ■ ■ ■ + \bi\n'^-^ + \bo\rt 



d-l 



.d-l 



{\bd-i\ + --- + \bi\ + \bo\) 



i\bd-i\ + ---+\bi\ + \bo\). 



Therefore 



bd-in''-^ + --- + bin + bo 
lim -j = 0. 



Thus there exists an M e N such that 

bd-iM^'-^ + '-'+biM + bo 



Md 



<1, 



which implies 



-(&rf_iM^-^ + --- + &iM+&o) 



Therefore g{M) > 0. 

Next we look at g{—n) for n e N. By a similar argument (exercise) we find that there exists 



some KeN such that bd-i{-K)'^~^ + --- + bi (-K) +bQ<K'^ and therefore g{-K) < (why?). 
In the proof make sure you use the fact that d is odd. In particular, if d is odd then {~n)^ — —[n^). 
We appeal to the intermediate value theorem, to find ace [—K^M] such that g{c) = 0. As 



M 

ad 



, we see that /(c) = 0, and the proof is done. 



□ 



Example 33.11: An interesting fact is that there do exist discontinuous functions that have the 
intermediate value property. The function 



sin(i/x) ifx^O, 







if jc = 0, 



is not continuous at 0, however, it has the intermediate value property. That is, for any a<b, and 
any y such that /(a) <y < f{b) or f{a) >y> f{b), there exists a c such that f{y) = c. Proof is 
left as an exercise. 
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3.3.3 Exercises 

Exercise 3.3.1; Find an example of a discontinuous function f: [0, 1] — > M where the intermediate value 
theorem fails. 

Exercise 3.3.2: Find an example of a bounded discontinuous function f: [0, 1] — > M that has neither an 
absolute minimum nor an absolute maximum. 

Exercise 3.3.3: Let f: (0, 1) —^Wbe a continuous function such that lim/(;c) = lim/(x) = 0. Show that f 
achieves either an absolute minimum or an absolute maximum on (0, 1) (but perhaps not both). 

Exercise 3.3.4: Let 

m ■■- 



sin(i/;c) ifx ^ 0, 
ifx = 0. 



Show that f has the intermediate value property. That is, for any a < b, if there exists a y such that 
f{a) <y < f{b) or f{d) >y> f{b), then there exists ac E {a, b) such that f{c) = y. 

Exercise 3.3.5: Suppose that g{x) is a polynomial of odd degree d such that 

g{x)=x'^ + bd-ix'^'^ + --- + bix + bo, 

for some real numbers bQ,b\ ,.. . ,hd-\- Show that there exists a G N such that g{—K) < 0. Hint: Make 
sure to use the fact that d is odd. You will have to use that {—n^ = 

Exercise 3.3.6: Suppose that g{x) is a polynomial of even degree d such that 

g{x) = x^ + bd-ix'^~^ + ■ ■ ■ + bix + bQ, 

for some real numbers bo,bi,..., b^-i. Suppose g{0) < 0. Show that g has at least two distinct real roots. 

Exercise 3.3.7: Suppose that f: [a, Z?] — )• M is a continuous function. Prove that the direct image f{[a, b]) is 
a closed and bounded interval or a single number. 

Exercise 3.3.8: Suppose that f: M — > M w continuous and periodic with period P>0. That is, f{x + P) = 
f{x) for all X G M. Show that f achieves an absolute minimum and an absolute maximum. 

Exercise 3.3.9 (Challenging): Suppose that f{x) is a bounded polynomial, that is, there is an M such that 
\f{x) I < Mfor all x G M. Prove that f must be a constant. 

Exercise 3.3.10: Suppose that f: [0, 1] — > [0, 1] is continuous. Show that f has a fixed point, that is, show 
that there exists anxE [0, 1] such that f{x) = x. 
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3.4 Uniform continuity 

Note: 1.5-2 lectures (Continuous extension and Lipschitz can be optional) 



3.4.1 Uniform continuity 

We made a fuss of saying that the 8 in the definition of continuity depended on the point c. There 
are situations when it is advantageous to have a 5 independent of any point. Let us give a name to 
this concept. 

Definition 3.4.1. Let 5 C M, and let / : S — )■ M be a function. Suppose that for any e > there 
exists a 5 > such that whenever x, c e 5 and \x — c\<5, then \f{x) — f{c) \ < e. Then we say / is 
uniformly continuous. 

It is not hard to see that a uniformly continuous function must be continuous. The only difference 
in the definitions is that for a given e > we pick a 5 > that works for all c e S. That is, 5 
can no longer depend on c, it only depends on £. The domain of definition of the function makes 
a difference now. A function that is not uniformly continuous on a larger set, may be uniformly 
continuous when restricted to a smaller set. 

Example 3.4.2: The function /: (0, 1) M, defined by f{x) := i/x is not uniformly continuous, 
but it is continuous. 

Proof: Given e > 0, then for e > | ^/x — to hold we must have 



\xy\ xy 
or 

\x—y\ < xye. 

Therefore, to satisfy the definition of uniform continuity we would have to have 5 < xys for all x,y 
in (0, 1), but that would mean that 5 < 0. Therefore there is no single 5 > 0. 

Example 3.4.3: /: [0, 1] — > R, defined by f{x) := x^ is uniformly continuous. 
Proof: Note that < x,c < 1. Then 

— c^l = |x + c| |x — c| < (|x| + |c|) |x — c| < (1 + 1) |jc — c| . 

Therefore given £ > 0, let 5 := e/2. If \x — c\ < 5, then \x^ — < e. 

On the other hand, / : M ^ M, defined by f{x) := jc^ is not uniformly continuous. 

Proof: Suppose it is, then for all e > 0, there would exist a 5 > such that if |x — c| < 5, then 
\x^ -c^\<e. Take x > and let c := x + S/2. Write 

e > |x^-c^| = |x + c| |x-c| = {2x + 8/2)8/2 > 5x. 
Therefore x < e/5 for all x > 0, which is a contradiction. 
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We have seen that if / is defined on an interval that is either not closed or not bounded, then / 
can be continuous, but not uniformly continuous. For a closed and bounded interval [a,b], we can, 
however, make the following statement. 

Theorem 3.4.4. Let f: [a, b] ^ M. be a continuous function. Then f is uniformly continuous. 

Proof. We prove the statement by contrapositive. Suppose that / is not uniformly continuous. We 
will prove that there is some c G [a,b] where / is not continuous. Let us negate the definition of 
uniformly continuous. There exists an e > such that for every 5 > 0, there exist points x,y mS 
with \x-y\< d and \f{x)-f{y) \ > e. 

So for the e > above, we can find sequences {xn} and {y,,} such that |jc„ —yn\ < V" ^^ch 
that \f{x„) — f{yn) \ > £• By Bolzano-Weierstrass, there exists a convergent subsequence {xn,^}. Let 
c := lim Xn,^. Note that as a < Xn^. < b, then a< c <b. Write 

As |c — x„J and ^/n^ go to zero when k goes to infinity, we see that } converges and the limit is 
c. We now show that / is not continuous at c. We estimate 

\f{c)-f{Xn,)\ = \fic)-fiyn,)+fiyn,)-fM\ 
>\f{yn,)-f{Xn,)\-\f{c)-f{yn,)\ 

>£-\f{c)-f{yn,)\. 

Or in other words 

\f{c)-f{x„,)\ + \f{c)-f{yn,)\>e. 

At least one of the sequences {f{xnj} or {/(jnj} cannot converge to /(c), otherwise the left hand 
side of the inequality would go to zero while the right-hand side is positive. Thus / cannot be 
continuous at c. □ 



3.4.2 Continuous extension 

Before we get to continuous extension, we show the following useful lemma. It says that uniformly 
continuous functions behave nicely with respect to Cauchy sequences. The new issue here is that 
for a Cauchy sequence we no longer know where the limit ends up; it may not end up in the domain 
of the function. 

Lemma 3.4.5. Let f: S be a uniformly continuous function. Let {xn} be a Cauchy sequence 
in S. Then {/(a:„)} is Cauchy. 

Proof. Let e > be given. Then there is a 5 > such that \ f{x) — f{y) \ < £ whenever \x — y\ < d. 
Now find an M G N such that for all n, > M we have |a:„ — I < 5 • Then for all n, > M we have 

\f{x„)-fixk)\<£. □ 
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An application of the above lemma is the following theorem. It says that a function on an open 
interval is uniformly continuous if and only if it can be extended to a continuous function on the 
closed interval. 

Theorem 3.4.6. A function f: {a,b) ^M. is uniformly continuous if and only if the limits 

La := lim/(jc) and := lim/(x) 

x^a x^b 

exist and the function / : [a, Z?] — t- M defined by 

(fix) ifxe{a,b), 
La ifx = a, 
Lb ifx = b, 

is continuous. 

Proof. One direction is not hard to prove. If / is continuous, then it is uniformly continuous by 
Theorem 3.4.4. As / is the restriction of / to {a,b), then / is also uniformly continuous (easy 
exercise). 

Now suppose that / is uniformly continuous. We must first show that the limits La and 
Lb exist. Let us concentrate on La. Take a sequence {xn} in {a,b) such that limx„ = a. The 
sequence is a Cauchy sequence and hence by Lemma 3.4.5, the sequence {/(x,,)} is Cauchy 
and therefore convergent. We have some number L\ := lim/(jc,j). Take another sequence {y„} 
in {a,b) such that limy,, = a. By the same reasoning we get L2 '.= lim/(y„). If we show that 
Li = L2, then the limit La — limx^a/(jc) exists. Let e > be given, find 5 > such that \x — y\<d 
implies \f{x)-f{y)\ < e/3. Find M G N such that for n > M we have \a-Xn\ < ^/2, |a-y„| < S/2, 
\f{xn) -Li\< e/3, and - L2I < e/3- Then for n > M we have 

l-^n — Jnl = \Xn—Cl + a—yn\ < \Xn — a \ + \a — yn\ < S/2 + S/2= 5. 

So 

|Li - L2 1 = |Li - f{Xn) + f{Xn) - f{yn) + - ^2 1 

< |Li - f{Xn) I + \f{Xn) - f{yn) \ + - L2 \ 

< e/3 + e/3 + e/3 = £. 

Therefore Li = L2. Thus La exists. To show that L^, exists is left as an exercise. 

Now that we know that the limits La and exist, we are done. If Xmvjc^afi^) exists, then 
Ymv^^afi^) exists (See Proposition 3.1.14). Similarly with L^. Hence / is continuous at a and b. 
And since / is continuous at c G (a, b), then / is continuous at c G (a, b) . □ 
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3.4.3 Lipschitz continuous functions 

Definition 3.4.7. Let / : 5 — )■ M be a function such that there exists a number K such that for all x 
and 3; in S we have 

\fix)-ny)\<K\x-y\. 
Then / is said to be Lipschitz continuous* . 

A large class of functions is Lipschitz continuous. Be careful, just as for uniformly continuous 
functions, the domain of definition of the function is important. See the examples below and the 
exercises. First we justify the use of the word continuous. 

Proposition 3.4.8. A Lipschitz continuous function is uniformly continuous. 

Proof. Let / : 5 -> M be a function and let ^ be a constant such that for all x,y in S we have 
\f{x)-f{y)\<K\x-y\. 

Let e > be given. Take 5 := ^/k. For any x and y in 5 such that \x~y \ < d we have that 



l/W <K\x~y\<Kd=K- = e. 



K 



Therefore / is uniformly continuous. 



□ 



We can interpret Lipschitz continuity geometrically. If / is a Lipschitz continuous function with 
some constant K. We rewrite the inequality to say that for x^y we have 



m-fiy) 



<K. 



The quantity 



m-fjy) ; 



x-y 



is the slope of the line between the points and {y,f{y)). Therefore, 



/ is Lipschitz continuous if and only if every line that intersects the graph of / in at least two 
distinct points has slope less than or equal to K. 

Example 3.4.9: The functions sm{x) and cos(x) are Lipschitz continuous. We have seen (Exam- 
ple 3.2.6) the following two inequalities. 

|sin(x) — sin(j)| < |.)c — y| and |cos(.)c) — cos(j)| < |.)c — j| . 



Hence sin and cos are Lipschitz continuous with ^ = 1 . 

Example 3.4.10: The function /: [l,oo) — )► M defined by f{x) := y/x is Lipschitz continuous. 
Proof: 



x-y 



\x-y\ 



x+y/y 



*Named after the German mathematician Rudolf Otto Sigismund Lipschitz (1832-1903). 
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As X > 1 and y > 1 , we see that 



< 



Therefore 



x-y 



On the other hand / : [0, oo) 
why: Suppose that we have 



— )• M defined by f{x) := ^/x is not Lipschitz continuous. Let us see 



-3^1 



for some K. Let y = to obtain y/x < Kx. If ^ > 0, then for x > we then get i/a: < y/x. This 
cannot possibly be true for all x> 0. Thus no such ^ > can exist and / is not Lipschitz continuous. 

The last example is a function that is uniformly continuous but not Lipschitz continuous. To 
see that y/x is uniformly continuous on [0,°°) note that it is uniformly continuous on [0, 1] by 
Theorem 3.4.4. It is also Lipschitz (and therefore uniformly continuous) on [1,°°). It is not hard 
(exercise) to show that this means that y/x is uniformly continuous on [0,oo). 



3.4.4 Exercises 

Exercise 3.4.1: Let f: S be uniformly continuous. Let A C S. Then the restriction /|a is uniformly 
continuous. 

Exercise 3.4.2: Let f: {a,b) -^Mbe a uniformly continuous function. Finish the proof of Theorem 3.4.6 by 
showing that the limit lim /(:<;) exists. 

Exercise 3.4.3: Show that f: (c,oo) Mfor some c > and defined by f{x) := i/x is Lipschitz continuous. 

Exercise 3.4.4: Show that f: — s- M defined by f{x) := i/x is not Lipschitz. continuous. 

Exercise 3.4.5: LetA,B be intervals. Let f: A — )• M and g: B ^Mbe uniformly continuous functions such 
that f{x) = g{x) for x £ ADB. Define the function h : AL)B^Mbyh{x) := f{x) ifx£Aandh{x) := g{x) if 
X £ B\A. a) Prove that ifACiB / 0, then h is uniformly continuous, b) Find an example where ACiB = (d 
and h is not even continuous. 

Exercise 3.4.6: Let f: M^Mbe a polynomial of degree d >2. Show that f is not Lipschitz continuous. 

Exercise 3.4.7: Let /: (0, 1) — ?■ M Z?e a bounded continuous function. Show that the function g{x) := 
x(l —x)f{x) is uniformly continuous. 

Exercise 3.4.8: Show that f: (0,°°) — t- M defined by f{x) := sin(i/A:) is not uniformly continuous. 

Exercise 3.4.9 (Challenging): Let f: Q — ?• M Z?e a uniformly continuous function. Show that there exists a 
uniformly continuous function / : M — t- M such that f{x) = f{x) for all x G Q. 
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3.5 Limits at infinity 



Note: less than 1 lecture (optional, can safely be omitted unless §3.6 or §5.5 is also covered) 
3.5.1 Limits at infinity 

Just like for sequences, a continuous variable can also approach infinity. Let us make this notion 
precise. 

Definition 3.5.1. We say oo is a cluster point of 5 C M if for every M G M, there exists an x e S such 
that x> M. Similarly -co is a cluster point of S if for every M G M, there exists an x G 5 such that 
x<M. 

Let / : 5 — )• M be a function and suppose that S has 0° as a cluster point. If there exists an L such 
that for every e > 0, there is an M G i? such that 



whenever x>M, then we say f{x) converges to L as x goes to 00. We call L the limit and write 



Alternatively we write f{x) — )■ L as x — )• 00. 

Similarly, if —00 is a cluster point of S and there exists an L such that for every e > 0, there is an 
M eR such that 



|/W-L|<e 

whenever x<M, then we say f{x) converges to Lslsx goes to —0°. We call L the limit and write 

lim f{x) := L. 



Alternatively we write f{x) Las x ^ —0°. 

Again we did cheat a little bit and said the limit. We leave it as an exercise to prove the following 
proposition. 

Proposition 3.5.2. The limit at 0° or —0° as defined above is unique if it exists. 
Example 3.5.3: Let/(x) := rJ^- Then 



|/W-L|<e 



lim f{x) := L. 



lim f{x) = 



and 



Hm f{x) = 0. 




< £. If X > M, then 
af for —00 is left to the 



3.5. LIMITS AT INFINITY 



121 



Example 3.5.4: Let f{x) := sin(;rjc). Then limx^oo/(x) does not exist. To prove this fact note that 
if jc = 2n + 1/2 for some n eN then f{x) = 1 , while if x = In + ^/i then f{x) = — 1 , so they cannot 
both be within a small £ of a single real number. 

We must be careful not to confuse continuous limits with limits of sequences. For f{x) = sin(;rjc) 
we could say 

lim f{n) = 0, but lim f{x) does not exist. 

Of course the notation is ambiguous. We are simply using the convention that n eN, while .x; G M. 
When the notation is not clear, it is good to explicitly mention where the variable lives, or what kind 
of limit are you using. 

There is a connection to limits of sequences, but we must take all sequences going to infinity, 
just as before in Lemma 3. 1 .7. 

Lemma 3.5.5. Suppose that / : S ^Ris a Junction, 00 is a cluster point ofS C M, and L G M. Then 

lim f{x) = L 

if and only if 

lim f{xn) = L 

for all sequences {xn } such that lim Xn = °°. 

The proposition holds for the limit as x — t- —0°. Its proof is almost identical and is left as an 
exercise. 

Proof First suppose that f{x) — > L as x — t- 0°. Given an e > 0, there exists an M such that for all 
;c > M we have \ f{x)—L\<e. Let {xn} be a sequence such that lim;c„ = 0°. Then there exists an 
such that for all n > we have jc„ > M. And thus |/(jc„) - L| < e. 

We prove the converse by contrapositive. Suppose that f{x) does not go to L as x — )► 00, Let us 
negate the statement. This means that there exists an e > such that for every M G N there exists 
an ;c > M, let us call it xm, such that \ f{xM) — ^| > £• Consider the sequence {xn}. It is clear that 
{f{xn) } does not converge to L. It remains to note that lim jc„ = 0° as x„ > n for all n. □ 

Using the lemma, we can again translate all the results about sequential limits into results 
about continuous limits as x goes to infinity. That is, we have almost immediate analogues of the 
corollaries in §3.1.3. We simply allow the cluster point c to be either 00 or —0° in addition to a real 
number. We leave it to the student to verify these statements. 

3.5.2 Infinite limit 

Just as for sequences, it is often convenient to distinguish certain divergent sequences, and talk 
about limits being infinite almost as if the limits existed. 



122 



CHAPTER 3. CONTINUOUS FUNCTIONS 



Definition 3.5.6. Let /: 5 — )■ M be a function and suppose that S has oo as a cluster point. We say 
that f{x) diverges to infinity as x goes to oo, if for every G M there exists an M G M such that 

f{x)>N 

whenever x> M. We write 

lim/(x) :=oo, 

or we say that f{x) — > oo as x — oo. 

A similar definition can be made for limits as x — > — oo or asx c for a finite c. Also similar 
definitions can be made for limits being — oo. Stating these definitions is left as an exercise. Note 
that sometimes converges to infinity is used. We can again use sequential limits, and an analogue of 
Lemma 3.1.7 is left as an exercise. 

Example 3.5.7: Let us show that lim;f^oo — °°- 
For jc > 1 we have 

l+x^ ^ x^ X 



l+x x+x 2 

Given N eR, take M = max{2A^+ 1, 1}. If x > M, then ;c > 1 and V2 > A^. So 

> - >M. 

l+x - 2 

3.5.3 Compositions 

Finally, just as for limits at finite numbers we can compose functions easily. 

Proposition 3.5.8. Suppose f : A ^ B, g : 5 — )■ C, A, 5, C C M, a G M U {—00, ooj is a cluster point 
of A, and G M U {—0°, 00 ]■ is a cluster point ofB. Suppose that 

lim f{x)=b and \img{y)=c 
for some c G M U {—00, ooj. Then 

limg(/W) =c. 

The proof is straightforward, and left as an exercise. Note that we already know the proposition 
when a,b,c E M. 

Example 3.5.9: Let h{x) := Then 



lim h{x) = 0. 
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Proof: The claim follows once we know 

lim —x^ + x = — oo 

and 

lim = 0, 

y—^—cx> 

which is usually proved when the exponential function is defined. 

3.5.4 Exercises 

Exercise 3.5.1: Prove Proposition 3.5.2. 

Exercise 3.5.2; Let f: [\,oo) ^M.be a function. Define g: (0, 1] — M via, g{x) = /(V-v)- Using the 
definitions of limits directly, show that \imx^Q+ g{x) exists if and only if\\mx^oof{x) exists, in which case 
they are equal. 

Exercise 3.5.3; Prove Proposition 3.5.8. 

Exercise 3.5.4; Let us justify terminology. Let f: M^Mbe a function such that Iimv^oo/(x) = oo (diverges 
to infinity). Show that f{x) diverges (i.e. does not converge) as x^ oo. 

Exercise 3.5.5; Come up with the definitions for limits of f{x) going to —°° as x ^ oo, x — )■ — oo, and as 
X —7- c for a finite c G M. Then state the definitions for limits of f{x) going to oo as x^ —oo, and as x^ cfor 
a finite c G M. 

Exercise 3.5.6: Suppose that P{x) ■.= x" + a„^ix"^^ -\ ^aix + ao is a monic polynomisi of degree n> 1 

(monic means that the coefficient ofx" is 1). a) Show that ifn is even then \imx^ooP{x) = Yircix-^-ooP{x) = oo. 
b) Show that ifn is odd then \imx^,^P{x) = oo and \\mx^-^P{x) = — oo (see previous exercise). 

Exercise 3.5.7: Let {x,,} be a sequence. Consider 5 : = N C M, and f:S^M. defined by f{n) := Xn. Show 
that the two notions of limit, 

lim Xn and lim f{x) 

are equivalent. That is, show that if one exists so does the other one, and in this case they are equal. 

Exercise 3.5.8; Extend Lemma 3.5.5 as follows. Suppose that 5 C M has a cluster point c G M, c = oo, or 
c = — oo. Let / : S — )• M Z?^ a function and let L = oo or L = — oo. Show that 

lim/(x) = L if and only if lim = L for all sequences {x„} such that lim x„ = c. 
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3.6 Monotone functions and continuity 

Note: 1 lecture ( optional, can safely be omitted unless §4.4 is also covered, requires §3.5) 



Definition 3.6.1. Let 5 C M. We say that / : 5 — )■ M is increasing (resp. strictly increasing) ifx,y&S 
with X <y implies f{x) < f{y) (resp. f{x) < f{y)). We define decreasing and strictly decreasing in 
the same way by switching the inequalities for /. 

If a function is either increasing or decreasing we say it is monotone. If it is strictly increasing 
or strictly decreasing we say it is strictly monotone. 

Sometimes nondecreasing (resp. nonincreasing) is used for increasing (resp. decreasing) func- 
tion to emphasize it is not strictly increasing (resp. strictly decreasing). 

3.6.1 Continuity of monotone functions 

It is easy to compute one-sided limits for monotone functions. 

Proposition 3.6.2. Let 5 C M, c G M, and f: 5 — M increasing. If c is a cluster point of 
5n (— c), then 

lim f{x) = sup{f{x) : X < c,x e S}, 
and ifc is a cluster point ofSCl (c, o°), then 

lim f{x) = inf{/(jc) : x > c,x E S}. 

Similarly, iff is decreasing and c is a cluster point ofS fl (— o°, c), then 

lim f{x) = mf{f{x) : x < c,x E S}, 

X— 7>C^ 

and ifc is a cluster point ofSH (c, °o), then 

lim f{x) = sup{f{x) : X > c^x e S}. 

x^c+ 

In particular all the one-sided limits exist whenever they make sense. 

Proof. Let us assume that / is increasing, and we will show the first equality. The rest of the proof 
is very similar and is left as an exercise. 

Let a := sup{/(x) : x < c,x G S}. If a = oo, then for every M G M, there exists an xm such that 
fi^Ai) > M and as / is increasing then f{x) > f{xM) > M for all x > xm- If we take 5 = c — we 
obtain the definition of the limit going to infinity. 

So assume a <o°. Let e > be given. Because a is the supremum, there exists an Xe < c,X£ E S, 
such that /(jce) > a — e. As / is increasing then if x G 5 and .^e < x < c we have a — £ < /(xe) < 
f{x) < a. Let 5 := c — X£. Then for x G 5n (— oo,c) with — c| < 5 we have \ f{x) — a| < £. □ 
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Suppose /: 5 — )■ M, c G 5 and that both one-sided limits exist. Since f{x) < f{c) < f{y) 
whenever x < c <y, taking the limits we obtain 

lim/W</(c)< Mm fix). 

Then of course / is continuous at c if and only if both limits are equal to each other (and hence 
equal to /(c)). See also Proposition 3.1.16. 

Corollary 3.6.3. If I C M is an interval and f: I ^Ris monotone, then f{I) is an interval if and 
only if f is continuous. 

Proof. If / is continuous then /(/) is an interval is a consequence of intermediate value theorem. 
See also Exercise 3.3.7. 

Let us prove the reverse direction by contrapositive. Suppose that / is not continuous at c G /, 
and that c is not an endpoint of /. Without loss of generality suppose that / is increasing. Let 

a :— lim f{x) = sup{/(x) : x G I,x < c}, b := lim f{x) = 'mf{f{x) : x G /,x > c}. 

As c is a discontinuity, then a < b. Any point in {a,b) \ {/(c)} is not in /(/). But if xi < c, then 
f{xi) < a, and if X2 > c, then f{x2) > b. Therefore, /(/) is not an interval. 

When c G / is an endpoint, the proof is similar and is left as an exercise. □ 

A striking property of monotone functions is that they cannot have too many discontinuities. 

Corollary 3.6.4. Let I GM.be an interval and / : / — t- M &c monotone. Then f has at most countably 
many discontinuities. 

Proof. Let £ C / be the set of all discontinuities that are not endpoints of /. As there are only 
two endpoints, it is enough to show that E is countable. Without loss of generality, suppose / is 
increasing. We will define an injection h: E ^ Q. For each c E E the one-sided limits of / both 
exist as c is not an endpoint. Let 

a := lim f{x) = sup{/(x) : x G fx < c}, b := lim f{x) = inf{/(x) : x G fx > c}. 

As c is a discontinuity, we have that a < b. There exists a rational number q G {a,b), so let 
h{c) := q. Because / is increasing, it is clear that q cannot correspond to any other discontinuity, 
so after making this choice for all c E E,we have that h is one-to-one (injective). Therefore, E is 
countable. □ 

Example 3.6.5: By [x\ denote the largest integer less than or equal to x. Define /: [0, 1] — )■ M by 

Li/(i-x)j 
f{x):=x+ £ 2-'\ 

for jc < 1 and /(I) =3. It is left as an exercise to show that / is strictly increasing, bounded, and has 
a discontinuity at all points 1 — i/fc for G N. In particular, there are countably many discontinuities, 
but the function is bounded and defined on a closed bounded interval. 
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3.6.2 Continuity of inverse functions 

A strictly monotone function / is obviously one-to-one (injective). Hence, it must have an inverse 
defined on its range. 

Proposition 3.6.6. If I C M. is an interval and f : I ^ M. is strictly monotone. Then the inverse 
f^^ : /(/) — > / is continuous. 

Proof. Let us suppose that / is strictly increasing. The proof is almost identical for a strictly 
decreasing function. 

Since / is strictly increasing, then so is /^^ Take c G /(/). If c is not a cluster point of /(/) 
then is continuous at c automatically. So let c be a cluster point of /(/). Suppose that both of 
the following one-sided limits exist: 

XQ := lim f^\y) = sup{/"^(3;) : y < c,y e f{I)} = sup{x : f{x) <c,xel}, 

Xl := lim f-\y) = mf{f-^{y) -.yyc.yE /(/)} = mf{x : f{x) > c,x E I}. 

For all x> xq with xEl,we have f(x) > c. As / is strictly increasing, we must have f{x) > c for 
all X > Xq, X E I. Therefore, all x > xq are candidates for the infimum xi. So xi = xq and is 
continuous at c. 

If one of the one-sided limits does not exist the argument is similar and is left as an exercise. □ 

Example 3.6.7: Notice that the proposition does not require / itself to be continuous. For example, 
let/: M^M 

X if jc < 0, 
x+\ ifx>0. 



The function / is not continuous at 0. The image of / = M is the set (— 0) U [1 , oo), no an interval. 
Then /^^ : (— oo,0) U [1,°°) — t- M can be written as 



X if X < 0, 
x—l if .Jc > 1. 



r\x)- 

It is not difficult to see that /~Ms a continuous function. 



3.6.3 Exercises 

Exercise 3.6.1: Suppose f: [0, 1] — )• M is monotone. Prove that f is bounded. 
Exercise 3.6.2; Finish the proof of Proposition 3.6.2. 
Exercise 3.6.3: Finish the proof of Corollary 3.6.3. 
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Exercise 3.6.4; Prove the claims in Example 3.6.5. 
Exercise 3.6.5: Finish the proof of Proposition 3.6.6. 

Exercise 3.6.6: Suppose S cM, and f : S ^ M is an increasing function, a) If c is a cluster point of SO {c,°°) 
show that lim f{x) < oo. b) If c is a cluster point ofSCi {—°°,c) and lim f{x) = 0°, prove that S C (— oo,c). 

Exercise 3.6.7: Suppose I cMis an interval and f: / — t- M m a function such that for each c £ I, there exist 
a,b gM with a > such that f{x) >ax + bfor allxGl and f{c) = ac + b. Show that f is strictly increasing. 

Exercise 3.6.8: Suppose f: I ^ J is a continuous, bijective (one-to-one and onto) function for two intervals 
I and J. Show that f is strictly monotone. 

Exercise 3.6.9: Given a monotone function f: / — ?■ M. Prove that there exists a function g: I such that 
lim g{x) = g{c) for all c £ I, except the smaller (left) endpoint of I, and such that g[x) = f{x) for all but 

countably many x. 

Exercise 3.6.10: a) Let 5 C M. If f : 5 — ?• M increasing, then show that there exists an increasing F : M — t- M 
such that f{x) = F(x) for all x £ S. b) Find an example of a strictly increasing f:S^M. such that an 
increasing F as above is never strictly increasing. 

Exercise 3.6.11 (Challenging): Find an example of an increasing function /: [0, 1] — J- M that has a disconti- 
nuity at each rational number Then show that the image /([0, 1]) contains no interval. Hint: Enumerate the 
rational numbers and define the function with a series. 



CHAPTER 3. CONTINUOUS FUNCTIONS 



Chapter 4 
The Derivative 



4.1 The derivative 

Note: 1 lecture 

The idea of a derivative is the following. Let us suppose that a graph of a function looks locally 
hke a straight line. We can then talk about the slope of this line. The slope tells us how fast is the 
value of the function changing at the particular point. Of course, we are leaving out any function 
that has comers or discontinuities. Let us be precise. 

4.1.1 Definition and basic properties 

Definition 4.1.1. Let / be an interval, let / : / — M be a function, and let c G /. If the limit 

L:=limMzM 

x^c X — C 

exists, then we say / is dijferentiable at c, that L is the derivative of / at c, and write f'{c) := L. 

If / is differentiable at all c G /, then we simply say that / is dijferentiable, and then we obtain a 
function/: / ^ M. 

The expression -^^^^J"^^^^ is called the difference quotient. 

The graphical interpretation of the derivative is depicted in Figure 4. 1 . The left-hand plot gives 
the line through (c,/(c)) and with slope ^"^"^ • When we take the limit as x goes to c, 

we get the right-hand plot, where we see that the derivative of the function at the point c is the slope 
of the line tangent to the graph of / at the point (c, /(c)) . 

Note that we allow / to be a closed interval and we allow c to be an endpoint of /. Some calculus 
books do not allow c to be an endpoint of an interval, but all the theory still works by allowing it, 
and it makes our work easier. 
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Example 4.1.2: Let f{x) := defined on the whole real line. We find that 

x^ — c^ (x + c)(x — c) , , 

km = km ^ ^ = lim(jc + c) = 2c. 

x-^c x — c x^c X — C x-^c 

Therefore /'(c) = 2c. 

Example 4.1.3: The function fix) := \x\ is not differentiable at the origin. When x > 0, then 

UI-IOI x-0 



jc-0 jc-0 

and when x < we have 



= 1, 



-x-0 _^ 



JC-0 x-0 

A famous example of Weierstrass shows that there exists a continuous function that is not 
differentiable at any point. The construction of this function is beyond the scope of this book. On 
the other hand, a differentiable function is always continuous. 

Proposition 4.1.4. Let f: I ^M.be differentiable at c E I, then it is continuous at c. 

Proof. We know that the limits 

lim ^^''^"•^^''^ = /' (c) and km (jc - c) = 

x-^c x — c x-^c 

exist. Furthermore, 
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Therefore the limit of f{x) — f{c) exists and 

lim(/W-/(c)) = (\\m^^^^^^^^) (lim(x-c)) =/(c)-0 = 0. 

Hence lim/(x) = /(c), and / is continuous at c. □ 

An important property of the derivative is linearity. The derivative is the approximation of 
a function by a straight line. The slope of a line through two points changes linearly when the 
^-coordinates are changed linearly. By taking the limit, it makes sense that the derivative is linear. 

Proposition 4.1.5. Let I be an interval, let f: / — M and g: I ^Rbe dijferentiable at c E I, and 
let a G M. 

(i) Define h: I ^'Rby h{x) := af{x). Then h is dijferentiable at c and h'{c) = ccf'{c). 

( ii) Define h: I ^Rby h{x) := f{x) +g{x). Then h is dijferentiable at c and h'{c) = f'{c) +g'{c). 

Proof. First, let h{x) = ccf{x). For x E I, x ^ c we have 

h{x)-h{c) af{x)-af{c) ^f{x)-f{c) 



The limit as x goes to c exists on the right by Corollary 3.1.12. We get 

x^c X — C X — C 

Therefore h is differentiable at c, and the derivative is computed as given. 
Next, define h{x) := f{x) -\-g{x). For xEl,x^cw& have 

h{x)-h{c) _ {f{x)+gix))-{f{c)+g{c)) _ f{x)-f{c) ^ g{x)-g{c) 

The limit as x goes to c exists on the right by Corollary 3.1.12. We get 

h{x)-h{c) f{x)-f{c) g{x)-g{c) 

hm — = hm — + lim * ^ ^ * ^ ^ 



x^c x — c Jf^f X — C X — C 

Therefore h is differentiable at c and the derivative is computed as given. □ 

It is not true that the derivative of a multiple of two functions is the multiple of the derivatives. 
Instead we get the so-called product rule or the Leibniz rule* . 



*Named for the German mathematician Gottfried Wilhelm Leibniz (1646-1716). 
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Proposition 4.1.6 (Product rule). Let I be an interval, let f: I ^M. and g: I ^Rbe functions 
dijferentiable at c. Ifh: I ^Ris defined by 

h{x) ■.= f{x)g{x), 

then h is differentiate at c and 

h\c)=f{c)g'{c)+f'{c)g{c). 

The proof of the product rule is left as an exercise. The key is to use the identity f{x)g{x) — 

f{c)g{c) = fix) {g{x) -g{c))+ g{c) (fix) - /(c)) . 

Proposition 4.1.7 (Quotient Rule). Let I be an interval, let f: / — > M and g: I^Wbe differentiable 
at c and g{x) ^ Ofor allxE I. Ifh: / — > R /s defined by 



then h is differentiable at c and 

Again the proof is left as an exercise. 
4.1.2 Chain rule 

A useful rule for computing derivatives is the chain rule. 

Proposition 4.1.8 (Chain Rule). Let Ii , I2 be intervals, let g: I\^hbe differentiable at c & h, and 
f:l2^M.be differentiable at g{c). Ifh : /i ^ M is defined by 

h{x) := {fog){x) = f{g{x)), 

then h is differentiable at c and 

h'{c)=f'{g{c))g'{c). 
Proof Let d := g(c). Define m : /2 ^ M and v : /i ^ M by 

\f'{d) if y = d, 
\g{c) if X = c. 
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We note that 

fiy)- f{d) = u{y){y-d) and g{x)- g{c) =v{x){x-c). 
We plug in to obtain 

h{x)-h{c) = f{gix)) -f{gic)) = u{gix)) {gix)-gic)) = u{gix)) {v{x)ix-c)). 
Therefore, 

^^^^i^=..few)vw. (4.1) 

x — c 

We compute the limits lim^^^M(y) = f'{d) = f'(g{c)) and limx^cv(-^) = s'i^). That is, the 
functions u and v are continuous at d = g{c) and c respectively. Furthermore the function g is 
continuous at c. Hence the limit of the right-hand side of (4.1) as x goes to c exists and is equal to 
/' [g{c))g'{c) . Thus h is differentiable at c and the limit is /' [g{c))g'{c) . □ 

4.1.3 Exercises 

Exercise 4.1.1; Prove the product rule. Hint: Use f{x)g{x) — f{c)g{c) = f{x) {g{x) —g{c)) +g{c) (/(x) — 

m). 

Exercise 4.1.2: Prove the quotient rule. Hint: You can do this directly, but it may be easier to find the 
derivative of ^/x and then use the chain rule and the product rule. 

Exercise 4.1.3; For n € Z, prove that x" is differentiable and find the derivative, unless, of course, n <0 and 
x = 0. Hint: Use the product rule. 

Exercise 4.1.4; Prove that a polynomial is differentiable and find the derivative. Hint: Use the previous 
exercise. 

Exercise 4.1.5; Let 

„ . jx^ ifx G Q, 
I otherwise. 

Prove that f is differentiable at 0, but discontinuous at all points except 0. 

Exercise 4.1.6; Assume the inequality \x — sin(x)| < x^. Prove that sin is differentiable at 0, and find the 
derivative at 0. 

Exercise 4.1.7; Using the previous exercise, prove that sin is differentiable at all x and that the derivative is 
cos(x). Hint: Use the sum-to-product trigonometric identity as we did before. 

Exercise 4.1.8; Let f: I ^Mbe differentiable. Given « G Z, define f" be the function defined by /"(x) := 
(/(x))". Ifn < assume f{x) / 0. Prove that (/")'(x) = «(/(x))"" VW- 
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Exercise 4.1.9; Suppose that f: W—^Wis a dijferentiable Lipschitz continuous function. Prove that f is a 
bounded function. 

Exercise 4.1.10; Let 1\ , I2 be intervals. Let / : 1\ I2 be a bijective function and g : I2 ^ h be the inverse. 
Suppose that both f is dijferentiable at c G h and f'{c) 7^ and g is differentiable at f{c). Use the chain 
rule to find a formula for g' {f{c)) ( in terms of f{c) ). 

Exercise 4.1.11; Suppose that f: I ^ Mis a bounded function and g: 7 — > R lv a function dijferentiable at 
c G I and g{c) = g'(c) = 0. Show that h{x) := f{x)g{x) is differentiable at c. Hint: Note that you cannot 
apply the product rule. 
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4.2 Mean value theorem 

Note: 2 lectures (some applications may be skipped) 

4.2.1 Relative minima and maxima 

We talked about absolute maxima and minima. These are the tallest peaks and lowest valleys in the 
whole mountain range. We might also want to talk about peaks of individual mountains and valleys. 

Definition 4.2.1. Let 5 C M be a set and let / : 5 — >■ M be a function. The function / is said to have 
a relative maximum at c G 5 if there exists a 5 > such that for sMx eS such that |jc — c| < 5 we 
have f{x) < f{c). The definition of relative minimum is analogous. 

Theorem 4.2.2. Let f: [a,Z?] — ?■ M be a function dijferentiable at c E {a,b), and c is a relative 
minimum or a relative maximum of f. Then f'{c) = 0. 

Proof. We prove the statement for a maximum. For a minimum the statement follows by considering 
the function — /. 

Let c be a relative maximum of /. In particular as long as \x — c\ < 5 we have f{x) — f{c) < 0. 
Then we look at the difference quotient. If .jc > c we note that 

/W^<o. 

x — c 

and if X < c we have 

/(^WM>o. 

x — c ~ 

We now take sequences {xn} and such that Xn > c, and yn < c for all n eN, and such that 
hm x„ = lim y„ = c. Since / is differentiable at c we know that 

> lim = /(c) = lim > 0. 

Xn — C n^°° yn — C 

□ 

4.2.2 RoUe's theorem 

Suppose that a function is zero at both endpoints of an interval. Intuitively it ought to attain a 
minimum or a maximum in the interior of the interval, then at such a minimum or a maximum, the 
derivative should be zero. See Figure 4.2 for the geometric idea. This is the content of the so-called 
RoUe's theorem. 

Theorem 4.2.3 (RoUe). Let f: [a,b] — )■ M continuous function differentiable on (a,b) such that 
f{a) = fib) = 0. Then there exists a c E {a,b) such that f'{c) = 0. 
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Figure 4.2: Point where tangent line is horizontal, that is /'(c) = 0. 



Proof. As / is continuous on [a,b] it attains an absolute minimum and an absolute maximum in 
[a,b]. If it attains an absolute maximum at c G {a,b), then c is also a relative maximum and we 
apply Theorem 4.2.2 to find that /'(c) = 0. If the absolute maximum is at a or at b, then we look for 
the absolute minimum. If the absolute minimum is at c G (a, b), then again we find that /'(c) = 0. 
So suppose that the absolute minimum is also at a or b. Hence the absolute minimum is and the 
absolute maximum is 0, and the function is identically zero. Thus f'{x) = for all x G [a, b], so pick 
an arbitrary c. □ 

4.2.3 Mean value theorem 

We extend RoUe's theorem to functions that attain different values at the endpoints. 

Theorem 4.2.4 (Mean value theorem). Let f: [a, b] —>M.be a continuous function differentiable on 
{a,b). Then there exists a point c G [a^b] such that 



Proof. The theorem follows from RoUe's theorem. Define the function g: [a, Z?] — M by 



The function ^ is a differentiable on {a,b), continuous on [a,b\, such that g{a) = and g{b) = 0. 
Thus there exists c G {a,b) such that g'{c) = 0. 



f{b)-f{a)=f{c){b-a). 



g{x) :=f(x)-f{b) + {f{b)-f{a 



)) 



b — x 



b — a 



= g'{c)=f{c) + {f{b)~f{a 



)) 



-1 



b — a 



Or in other words /'(c) {b — a)= f{b) — f{a) . 



□ 
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For a geometric interpretation of the mean value theorem, see Figure 4.3. The idea is that the 
value '^^'^lz{}"^ is the slope of the line between the points {a,f{d)) and (^b,f{b)). Then c is the 

point such that f'{c) = ^^^Izl'^"^ , that is, the tangent line at the point (c,/(c)) has the same slope 
as the line between (a, /(a)) and (^b,f{b)). 




Figure 4.3: Graphical interpretation of the mean value theorem. 



4.2.4 Applications 

We now solve our very first differential equation. 

Proposition 4.2.5. Let I be an interval and let f: I ^ be a dijferentiable function such that 
f'{x) = Ofor all X E L Then f is constant. 

Proof. Take arbitrary x,y El with x <y. Then / restricted to [x,y\ satisfies the hypotheses of the 
mean value theorem. Therefore there is a c G {x,y) such that 

f{y)-m=f'{c){y-x). 

as f\c) = 0, we have f{y) = f{x). Therefore, the function is constant. □ 

Now that we know what it means for the function to stay constant, let us look at increasing and 
decreasing functions. We say that /: / — )• M is increasing (resp. strictly increasing) ifx<y implies 
f{x) < f{y) (resp. f{x) < f{y)). We define decreasing and strictly decreasing in the same way by 
switching the inequalities for /. 

Proposition 4.2.6. Let f: I ^M.be a differentiable function. 

(i) f is increasing if and only if f\x) > Ofor all x E L 

( ii) f is decreasing if and only if f'{x) < Ofor all x EL 
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Proof. Let us prove the first item. Suppose that / is increasing, then for all x and c in / we have 

/(^WM>o. 

x — c ~ 

Taking a limit as x goes to c we see that f'{c) > 0. 

For the other direction, suppose that f'{x) > for all x G /. Let x <y ml. Then by the mean 
value theorem there is some c E {x,y) such that 

f{x)-f{y)=r{c){x-y). 

As /'(c) > 0, and x — y<0, then f{x) — f{y) < and so / is increasing. 

We leave the decreasing part to the reader as exercise. □ 

Example 4.2.7: We can make a similar but weaker statement about strictly increasing and decreas- 
ing functions. If f'{x) > for all x E I, then / is strictly increasing. The proof is left as an exercise. 
The converse is not true. For example, f{x) := jc-^ is a strictly increasing function, but /'(O) = 0. 

Another application of the mean value theorem is the following result about location of extrema. 
The theorem is stated for an absolute minimum and maximum, but the way it is applied to find 
relative minima and maxima is to restrict / to an interval (c — 5, c + 5). 

Proposition 4.2.8. Let f: {a,b) -^Rbe continuous. Let c E {a,b) and suppose f is dijferentiable 
on (a,c) and {c,b). 

( i) If f\x) < Ofor X E (a, c) and f'{x) > Ofor x E (c, b), then f has an absolute minimum at c. 

( ii) If f'{x) > Ofor X E (a, c) and f'{x) < Ofor x E (c, b), then f has an absolute maximum at c. 

Proof. Let us prove the first item. The second is left to the reader. Let x be in (a, c) and {j„} a 
sequence such that x<y„ <c and lim yn — c. By the previous proposition, the function is decreasing 
on (a, c) so f{x) > f{yn)- The function is continuous at c so we can take the limit to get f{x) > f{c) 
for all.x: E (a,c). 

Similarly take x E (c, b) and {yn} a sequence such that c <y„ < x and lim j„ = c. The function 
is increasing on {c,b) so f{x) > f{yn)- By continuity of / we get f{x) > f{c) for all x E {c,b). 
Thus f{x) > f{c) for sX\xE{a,b). □ 

The converse of the proposition does not hold. See Example 4.2.10 below. 

4.2.5 Continuity of derivatives and the intermediate value theorem 

Derivatives of functions satisfy an intermediate value property. The result is usually called Darboux's 
theorem. 

Theorem 4.2.9 (Darboux). Let f: [a,b\ — )■ M Z?e differentiable. Suppose that there exists ay eM. 
such that f\a) <y < fib) or f'{a) > y> fib). Then there exists a c E {a, b) such that f'{c) = y. 
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Proof. Suppose without loss of generality that f'{a) <y < f'{b). Define 

g(x) ■=yx-f(x). 

As g is continuous on [a,b], then g attains a maximum at some c e [a,b]. 

Now compute g'{x) = y — f'{x). Thus g'{a) > 0. As the derivative is the limit of difference 
quotients and is positive, there must be some difference quotient that is positive. That is, there must 
exist an X > a such that 

g{x)-g{a) 



X — a 



>0, 



or g{x) > g{a). Thus a cannot possibly be a maximum of g. Similarly as g'{b) < 0, we find anx<b 
(a different x) such that ^^^Iz.l^^'' < or that g{x) > g{b), thus b cannot possibly be a maximum. 
Therefore c e {a,b). Then as c is a maximimi of g we find g'{c) = and f'{c) =y. □ 

We have seen already that there exist discontinuous functions that have the intermediate value 
property. While it is hard to imagine at first, there also exist functions that are differentiable 
everywhere and the derivative is not continuous. 



Example 4.2.10: Let/: 



be the function defined by 

\ (xsin{yx)Y ifx^O, 
10 ifjc = 0. 

We claim that / is differentiable, but /' : R — > R is not continuous at the origin. Furthermore, / has 
a minimum at 0, but the derivative changes sign infinitely often near the origin. 

Proof: It is easy to see from the definition that / has an absolute minimum at 0: we know that 
f{x) > for all X and /(O) = 0. 

The function / is differentiable for x^O and the derivative is 2sin(i/x) (jcsin(i/x) cos(i/x)) . 
As an exercise show that for x„ = (g^^j-)^ we have lim = — 1, and for yn = ^g^^^^^ we have 
hm f'{yn) = 1- Hence if /' exists at 0, then it cannot be continuous. 

Let us show that /' exists at 0. We claim that the derivative is zero. In other words 
goes to zero as x goes to zero. For x 7^ we have 



m-m 



x-O 



-0 







x^sm^{i/x) 


x-O 




X 



xsm" 



'm\< 



\x\ 



And, of course, as x tends to zero, then |x| tends to zero and hence 
Therefore, / is differentiable at and the derivative at is 0. 



m-m 

x-O 



goes to zero. 



It is sometimes useful to assume that the derivative of a differentiable function is continuous. If 
/ : / — )> M is differentiable and the derivative /' is continuous on /, then we say that / is continuously 
differentiable. It is common to write (/) for the set of continuously differentiable functions on /. 
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4.2.6 Exercises 

Exercise 4.2.1: Finish the proof of Proposition 4.2.6. 
Exercise 4.2.2: Finish the proof of Proposition 4.2.8. 

Exercise 4.2.3: Suppose that f: M^Mis a differentiable function such that f is a bounded function. Then 
show that f is a Lipschitz continuous function. 

Exercise 4.2.4: Suppose that f: [a,b] ^ M is differentiable and c G [a^b]. Then show that there exists a 
sequence {x„} converging to c, x„ ^ cfor all n, such that 

/(c) = lim/(x„). 
Do note that this does not imply that f is continuous (why?). 

Exercise 4.2.5: Suppose that f: M — )• M a function such that |/(x) — f{y) \ < \x — y\^ for all x and y. Show 
that f[x) = Cfor some constant C. Hint: Show that f is differentiable at all points and compute the derivative. 

Exercise 4.2.6: Suppose that I is an interval and f: I ^Mis a differentiable function. If f'{x) > 0/or all 
x£ I, show that f is strictly increasing. 

Exercise 4.2.7: Suppose f: {a,b) -^M is a differentiable function such that f'{x) ^ Q for all x G {a,b). 
Suppose that there exists a point c ^ {a,b) such that f (c) > 0. Prove that f (x) >Oforallx£ {a,b). 

Exercise 4.2.8: Suppose that f: (a, Z?) — )• M and g: (a, ^) — >• M are differentiable functions such that f'{x) = 
g' (x) for all X €z {a,b), then show that there exists a constant C such that f{x) = g{x) +C. 

Exercise 4.2.9: Prove the following version of L'Hopital's rule. Suppose that f : (a,b) and g: {a,b) ^ 
M are differentiable functions. Suppose that at c ^ {ci,b), f{c) = 0, g{c) = 0, and that the limit of fi^)/g'(x) as 
X goes to c exists. Show that 

hm-— =lim— -. 

x^cg{x) ■^^cg'{x) 

Exercise 4.2.10: Let f: {a,b) -^M be an unbounded differentiable function. Show that f : {a^b) — t- M is 
unbounded. 
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4.3 Taylor's theorem 

Note: half a lecture (optional section) 



4.3.1 Derivatives of higher orders 

When / : / — )■ M is differentiable, we obtain a function /' : / — )■ M. The function /' is called the, first 
derivative of /. If /' is differentiable, we denote by /" : / — )■ M the derivative of The function /" 
is called the second derivative of /. We similarly obtain and so on. With a larger number 

of derivatives the notation would get out of hand; we denote by the nth derivative of /. 
When / possesses n derivatives, we say that / is n times dijferentiable. 



4.3.2 Taylor's theorem 

Taylor's theorem* is a generalization of the mean value theorem. It tells us that up to a small error, 
any n times differentiable function can be approximated at a point xq by a polynomial. The error of 
this approximation behaves like (x — xq)" near the point xq. To see why this is a good approximation 
notice that for a big n, (x — xq)" is very small in a small interval around xq. 

Definition 4.3.1. For an n times differentiable function / defined near a point xq E M, define the 
nth Taylor polynomial for / at xq as 

k=i 

^ f[xQ)+f [xq){x-xq)^ — (.^-.^o) H 7 — (^-^o) +---H j — (-^--^o) ■ 

z n\ 

Taylor's theorem says a function behaves like its nth Taylor polynomial. The mean value 
theorem is really Taylor's theorem for the first derivative. 

Theorem 4.3.2 (Taylor). Suppose f: [a, Z?] — ?■ M is a function with n continuous derivatives on [a, b] 
and such that /("+^) exists on {a,b). Given distinct points xq andx in [a,b], we can find a point c 
between xq and x such that 

f{x)=K'{x)+^-j^^{x-x,r+\ 

— xq)"^^ is called the remainder term. This form of the remainder 
term is called the Lagrange form of the remainder. There are other ways to write the remainder 
term, but we skip those. Note that c depends on both x and xq. 



*Named for the English mathematician Brook Taylor (1685-1731). It was first found by the Scottish mathematician 
James Gregory (1638 - 1675). The statement we give was proved by Joseph-Louis Lagrange (1736 - 1813) 
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Proof. Find a number Mx^xq (depending on x and xq) solving the equation 

f{x)=p:^^{x)+Mx,x,{x-xor+'. 

Define a function g{s) by 

g{s):=f{s)-P:^{s)-Mx^xo{s-xor+\ 

A simple computation shows that the ^th derivative of the Taylor polynomial (P^^)^'^^ (xq) = f'^^^ {xq) 
for A; = 0, 1,2, ... ,n (the zeroth derivative corresponds to the function itself). Therefore, 

g{xQ) = g'ixo) = g"{xo) = ■■■= ^(")(xo) = 0. 

In particular g{xo) = 0. On the other hand g{x) = 0. By the mean value theorem there exists an xi 
between xq and x such that g'{xi) = 0. Applying the mean value theorem to g' we obtain that there 
exists X2 between xq and xi (and therefore between xq and x) such that g"{x2) = 0. We repeat the 
argument n + l times to obtain a number Xn+i between xq and Xn (and therefore between xq and x) 
such that^("+i)(jc„+i) =0. 

Let c :=Xn+i. We compute the (n + l)th derivative of g to find 

=/(«+!)(,) -(n + l)!M.,.„. 
Plugging in c for s we obtain that M^^xo = , and we are done. □ 

Li the proof we have computed that (1^°)^^^ (xq) = /*^^) (xq) for = 0, 1 , 2, . . . , n. Therefore the 
Taylor polynomial has the same derivatives as / at xq up to the nth derivative. That is why the 
Taylor polynomial is a good approximation to /. 

In simple terms, a differentiable function is locally approximated by a line, that's the definition 
of the derivative. Similarly we mention in passing that there exists a converse to Taylor's theorem, 
saying that if a function is locally approximated in a certain way by a polynomial of degree d, then 
it has d derivatives. 



4.3.3 Exercises 

Exercise 4.3.1: Compute the nth Taylor Polynomial at Ofor the exponential function. 

Exercise 4.3.2; Suppose that p is a polynomial of degree d. Given any xq G R, show that the [d + \)th Taylor 
polynomial for p at xq is equal to p. 

Exercise 4.3.3: Let f{x) := Compute f'{x) and f" [x) for all x, but show that f^^\0) does not exist. 

Exercise 4.3.4; Suppose / : R — t- M has n continuous derivatives. Show that for any xq G M, there exist 
polynomials P and Q of degree n and an £ > such that P{x) < f[x) < Q{x)for all x £ [xq — £,xo + s] and 
Q{x) — P(x) = X{x — X{))" for some X >0. 
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Exercise 4.3.5: Iff: [a,b] — > M has n+ 1 continuous derivatives andxQ G [a,b\, prove lim — = 0. 

Exercise 4.3.6; Suppose that /: [fl;,Zj] — ^ M has n + \ continuous derivatives and xq G {a,b). Show that 
f^^^xo) = for all k = 0,1,2, ... ,n if and only if g{x) := '^^^}„+i is continuous at xq. 
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4.4 Inverse function theorem 

Note: less than 1 lecture (optional section, needed for §5.4, requires §3.6) 
4.4.1 Inverse function theorem 

The main idea of differentiating inverse functions is the following lemma. 

Lemma 4.4.1. Let I,J dM. be intervals. If f: I ^ J is a strictly monotone (hence one-to-one) 
function that is continuously differentiable, onto (f{I) = J), and f is never zero, then the inverse 
f^ ^ is also continuously differentiable and 

{f-^y{y)= \ for ally e J. 

Proof. By Proposition 3.6.6 we have that / has a continuous inverse. Let us call the inverse g: J I 
for convenience. For s,y E J find t,x E I such that f{t) = s and f{x) = y. Then 

g{s)-g{y) g{fit))-g{fix)) t-x 



s-y m-f{x) fit) -fix)- 

Next note that t = g{s), x = g{y), g is continuous, / is differentiable at x, and f'{x) ^ 0. Therefore 



« s-y /'(j;) f{g(y))- 

As both /' and g are continuous, then g' is also continuous. □ 

What is usually called the inverse function theorem is the following result. 

Theorem 4.4.2 (Inverse function theorem). Let f: {a,b) be a continuously differentiable 
function, xq E (a, b) a point where f{xo) 7^ 0. Then there exists an interval I C (a, b) with xq E I, 
the restriction /|/ is injective with an inverse g: J ^ I defined on J := f{I), which is continuously 
differentiable and 

g'{y) = r,/ , for ally E J. 

Proof. Without loss of generality, suppose that f'{xo) > 0. As /' is continuous, there must exist an 
interval / with xq E I such that f'(x) > for all xq E I. 

By Exercise 4.2.6 we have that / is strictly increasing on / and hence the restriction f\j bijective 
onto J := f{I). As / is continuous, then by the intermediate value theorem (see also Corollary 3.6.3), 
/(/) is in interval. Now apply Lemma 4.4.1. □ 
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If you have tried to prove the existence of roots directly as in Example 1.2.3 you may have seen 
how difficult that is. However, with the machinery we have built for inverse functions it becomes 
almost a trivial exercise, and with and the inverse function theorem we prove far more than mere 
existence. 

Corollary 4.4.3. For any n G N and any x>0 there exists a unique positive number y ( denoted 
^i/n ■=y)^ such that y" = x. Furthermore, the function g : (0,oo) — )■ (0,oo) defined by g{x) :=x^^" 
is continuously differentiable and 



1 1 



_^{l-n)/n 



using the convention x"^"^ := (x^/" 



Proof. For x = Q the existence of a unique root is trivial. 

Let f{x) := x". We have seen / is continuously differentiable and f'{x) = nx"^K For x> the 
derivative /' is strictly positive and so again by Exercise 4.2.6, / is strictly increasing (this can also 
be proved directly). It is also easy to see that the image of / is the entire interval (0,oo). We obtain 
a unique inverse g and so the existence and uniqueness of positive nth roots. We apply Lemma 4.4.1 
to obtain the derivative. □ 

Example 4.4.4: The corollary provides a good example of where the inverse function theorem only 
gives us a smaller interval. Take f{x) := x^. Then f'{x) 7^ as long as .x; 7^ 0. If xq > 0, we can take 
/ = (0,00), but no larger. 

Example 4.4.5: Another useful example is f{x) :=x^. The function /: M — )• M is one-to-one and 
onto, so f^^{x) = x^^^ exists on the entire real line including zero and negative x. The function / 
has a continuous derivative, but has no derivative at the origin. The point is that /'(O) = 0. See 
also Exercise 4.4.4. 



4.4.2 Exercises 

Exercise 4.4.1; Suppose that f: M — ?• R continuously differentiable such that f'{x) > Ofor all x. Show 
that f is invertible on the interval J = /(M), the inverse is continuously differentiable, and (/^^) {y) > Ofor 
allyGf{R). 

Exercise 4.4.2; Prove the following version of the inverse function theorem: Let fJcMbe intervals. If 
/: I ^ J is strictly monotone (hence one-to-one) and onto. Suppose that f is differentiable at xq and 
f'{xQ) / 0. Then prove that the inverse f^^ is differentiable atyo = /'(jco) <^nd (f^^Yiyo) = ttAtt- 



Exercise 4.4.3; Let n be even. Prove that every x > has a unique negative nth root, that is a negative 
number y such thaty" = x. Compute the derivative of the function g{x) := y. 
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Exercise 4.4.4; Let n be odd and n>3. Prove that every x has a unique nth root, that is a number y 
such that y" = x. Prove that the function defined by g{x) :=y is dijferentiable except atx = and compute 
the derivative. Prove that g is not dijferentiable at x = 0. 

Exercise 4.4.5 (requires §4.3); Show that if in the inverse function theorem f has k continuous derivatives, 
then the inverse function g also has k continuous derivatives. 

Exercise 4.4.6; Let f{x) := x + 2x^ sin(i/A) for x 7^ and /(O) = 0. Show that f is differentiable at all x, 
that /'(O) > 0, but that f is not invertible on any interval containing the origin. 

Exercise 4.4.7: a ) Let f: M — )• M fte a continuously dijferentiable function and k > be a number such that 
f'{x) > kfor all x € M. Show that f is one-to-one and onto, and has a continuously differentiable inverse 
: M — )■ M. b) Find an example / : M — >• M where f'{x) > Ofor all x, but f is not onto. 



Chapter 5 



The Riemann Integral 



5.1 The Riemann integral 

Note: 1.5 lectures 

We now get to the fundamental concept of integration. There is often confusion among students 
of calculus between integral and antiderivative. The integral is (informally) the area under the 
curve, nothing else. That we can compute an antiderivative using the integral is a nontrivial result 
we have to prove. In this chapter we define the Riemann integral* using the Darboux integral^, 
which is technically simpler than (but equivalent to) the traditional definition as done by Riemann. 



5.1.1 Partitions and lower and upper integrals 

We want to integrate a bounded function defined on an interval [a,b]. We first define two auxiliary 
integrals that can be defined for all bounded functions. Only then can we talk about the Riemann 
integral and the Riemann integrable functions. 

Definition 5.1.1. A partition P of the interval [a,b] is a finite set of numbers {xo,xi,X2: ■ ■ ■ :X„} 
such that 

a = Xq < Xl < X2 < ■ ■ ■ < Xn- \ <Xn = b. 

We write 

AjC/ . Xi ^i— 1 • 



*Named after the German mathematician Georg Friedrich Bernhard Riemann (1826-1866). 
^^Named after the French mathematician Jean-Gaston Darboux (1842-1917). 
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Let / : [a, Z?] — 7- M be a bounded function. Let P be a partition of [a, b] . Define 

mi := inf{/(x) : Xj-i <x< Xi}, 
Mi := sup{/(x) : jc,-i <x < x,-}, 

n 

HP J) ■= E"^/^/, 

i=l 

U{P,f):=f^M,Ax,. 

1=1 

We call L{P,f) the /ower Darboux sum and f/ (^,/) the upper Darboux sum. 

The geometric idea of Darboux sums is indicated in Figure 5.1. The lower sum is the area of 
the shaded rectangles, and the upper sum is the area of the entire rectangles. The width of the rth 
rectangle is Ax,-, the height of the shaded rectangle is m, and the height of the entire rectangle is Mi. 




Figure 5.1: Sample Darboux sums. 



Proposition 5.1.2. Let f: [a, Z?] — t- M Z^e a bounded function. Let m^M EM.be such that for all x we 
have m < f{x) < M. For any partition P of [a, b] we have 

m{b-a)<L{PJ) < U{PJ) <M{b-a). (5.1) 

Proof. Let P be a partition. Then note that m < m, for all / and M, < M for all /. Also m,- < for 
all i. Finally Y!i^\ = (b — a). Therefore, 

(n \ n n 

^ Ax/ I = ^ mAx/ < ^ miAxi < 
i=l J i=l i=l 

n n / " \ 

< J^M/Ajc/ < J^MAx,- =M J^Ax/ =M{b-a). 

i=l i=l \i=\ J 
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Hence we get (5.1) . In other words, the set of lower and upper sums are bounded sets. □ 
Definition 5.1.3. As the sets of lower and upper Darboux sums are bounded, we define 

sup{L(/', f):Pa partition of [a, Z?] }, 

mf{U {PJ) : P a partition of [a, b]]. 

We call / the lower Darboux integral and / the upper Darboux integral. To avoid worrying about 
the variable of integration, we often simply write 

rb rb rb rb 

f-= f{x)dx and I f '■= I f{^)dx. 

J a J a J a J a 

If integration is to make sense, then the lower and upper Darboux integrals should be the same 
number, as we want a single number to call the integral. However, these two integrals may in fact 
differ for some functions. 

Example 5.1.4: Take the Dirichlet function /: [0, 1] M, where f{x) := 1 if x G Q and f{x) := 
if jc ^ Q. Then 

/V = and /V=l- 

70 JO 

The reason is that for every i we have that m, = inf{/(jc) : x E [jc,_i,x/]} = and Mj = sup{/(.x:) : 
X G [x/_i,;ic,]} = 1. Thus 

L(/',/) = £0-A^, = 0, 

t/(/',/) = £l-M = £A^, = l. 

Remark 5.1.5. The same definition of J^f and /*/ is used when / is defined on a larger set S 
such that [a, b] C S. In that case, we use the restriction of / to [a, b] and we must ensure that the 
restriction is bounded on [a,b]. 

To compute the integral we often take a partition P and make it finer. That is, we cut intervals in 
the partition into yet smaller pieces. 

Definition 5.1.6. Let P = {xq^xi,. . . ,jc„} and P = {xq.xi,. . . ,Xm} be partitions of [a,b]. We say P 
is a refinement of P if as sets P C P. 
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That is, P is a refinement of a partition if it contains all the numbers in P and perhaps some other 
numbers in between. For example, {0, 0.5, 1 , 2} is a partition of [0, 2] and {0, 0.2, 0.5, 1 , 1 .5, 1 .75, 2} 
is a refinement. The main reason for introducing refinements is the following proposition. 

Proposition 5.1.7. Let f: [a,b] ^M. be a bounded function, and let P be a partition of [a, b]. Let P 

be a refinement of P. Then 

L{PJ)<L{PJ) and U{PJ) < U{PJ). 

Proof. The tricky part of this proof is to get the notation correct. Let P := {xq.xi,. . . ,x,n} be 
a refinement of P .= {xq.xi, ... ,x„}. Then xq = xq and x„ = Xm- In fact, we can find integers 
^0 < ^1 < ■ ■ ■ < such that jc/ = i^. for / = 0, 1,2, . . . ,n. 
Let Axj = Xj-i —Xj. We get that 

Axj = ^p- 

P=kj-i + l 

Let nij be as before and correspond to the partition P. Let mj := inf{/(x) : Xj-\ <x< Xj}. Now, 
tnj < nip for kj^i < p <kj. Therefore, 

kj kj kj 

nijAxj = nij ^ Axp = ^ mjAxp < ^ nip Ax p. 

p—kj_i + \ p=kj_i + l p=kj_i + l 



So 



L(P^ /) = E mjAxj < £ £ fhpAxp = £ mjAxj = L{P, f) . 

j=l j^ip=kj^y + l j=l 

The proof of U {P, f) <U {P, f) is left as an exercise. □ 

Armed with refinements we prove the following. The key point of this next proposition is that 
the lower Darboux integral is less than or equal to the upper Darboux integral. 

Proposition 5.1.8. Let f: [a, Z?] — t- M &e a bounded function. Let m,M eM. be such that for all x we 
have m< f{x) <M. Then 

m{b-a)< f f < f f <M{b-a). (5.2) 

J a J a 

Proof. By Proposition 5. 1 .2 we have for any partition P 

m{b-a)<L{PJ) < U{PJ) <M{b-a). 
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The inequality m{b — a) < L{P,f) implies m{b — a) < j^f. Also U{P,f) < M{b — a) implies 
~]!f<M{b-a). 

The key point of this proposition is the middle inequality in (5.2) . Let Pi , P2 be partitions of [a,b\. 
Define P .= P1UP2. The set P is a partition of [a,b]. Furthermore, P is a refinement of Pi and it is 
also a refinement of /'2- By Proposition 5.1.7 we have L(A,/) <L{PJ) and U{P J) <U{P2j). 
Putting it all together we have 

L{PiJ) < L{P,f) < U{PJ) < U{P2,f). 

In other words, for two arbitrary partitions Pi and P2 we have L(Pi,f) < U{P2,f). Now we recall 
Proposition 1.2.7. Taking the supremum and infimum over all partitions we get 

sup{L{PJ) : P a partition} < mf{U{P,f) : P a partition}. 
In other words /,V</,V. □ 



5.1.2 Riemann integral 

We can finally define the Riemann integral. However, the Riemann integral is only defined on a 
certain class of functions, called the Riemann integrable functions. 

Definition 5.1.9. Let / : [a, Z?] — t- M be a bounded function such that 

rb rb 

/ f{x) dx= fix) dx. 

J a J a 

Then / is said to be Riemann integrable. The set of Riemann integrable functions on [a, b] is denoted 
by 3S[a, b] . When / G .^[a, Z?] we define 



rb rb rb 

/ f{x)dx:= / f{x)dx= / f{x)dx. 

J a J a J a 



As before, we often simply write 



rb rb 

/ /:= / fix)dx. 

J a J a 



The number /*/ is called the Riemann integral of /, or sometimes simply the integral of /. 

By definition, any Riemann integrable function is bounded. By appealing to Proposition 5.1.8 
we immediately obtain the following proposition. 

Proposition 5.1.10. Let f: [a,b] ^M.be a Riemann integrable function. Let m,M be such that 
m < fix) < M. Then 



mib-a) < [ f<Mib-a). 

J a 
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Often we use a weaker form of this proposition. That is, if \f{x) \ < M for all x E [a,b], then 



J a 



<M{b-a) 



Example 5.1.11: We integrate constant functions using Proposition 5.1.8. If f{x) := c for some 
constant c, then we take m = M = c. In inequality (5.2) all the inequahties must be equalities. Thus 
/ is integrable on [a, b] and f = c{b — a). 

Example 5.1.12: Let /: [0, 2] ^ M be defined by 







if jc< 1, 


m := < 


r 


\fx= 1 , 






\fx> \. 



We claim that / is Riemann integrable and that Jq f = 1 . 

Proof: Let < £ < 1 be arbitrary. Let P := {0, 1 — £, 1 + £, 2} be a partition. We use the notation 
from the definition of the Darboux sums. Then 



mi = inf{/(jc) : X G [0, 1 - £] } = 1 , Mi = sup{/(jc) : ;c G [0, 1 - £] } = 1 , 

m2 = inf{/(jc) : X G [1 - £, 1 + £] } = 0, M2 = sup{/(jc) : x e [1-£,1+£]} = 1, 
m3=mf{f{x):xe [l+£,2]} = 0, M3 = sup{/(jc) : x G [l + £,2]} = 0. 

Furthermore, Axi = 1 — £, Ax2 = 2£ and Ax^ = 1 — £. We compute 

3 

^P,/) = £m,Ajc, = l-(l-£)+0-2£ + 0-(l-£) = l-£, 
i=l 
3 

U{PJ) = £m,Aa;/ = l-(l-£) + l-2£ + 0-(l-£) = l+£. 
Thus, 

2 2 

//-/ f<U{PJ)-L{PJ) = {l + e)-{\-e)=2e. 
Jo Jo_ 

By Proposition 5.1.8 we have Jq / < Jq/. As £ was arbitrary we see that Jq / = Jq/. So / is 
Riemann mtegrable. Finally, 

l-£ = L(P,/)< f<U{PJ) = \+e. 
Jo 

I 2 I 2 

Hence, Jq / — 1 < £. As £ was arbitrary, we have that Jq / = 1 . 
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It may be worthwhile to sum up part of the technique of the example in a proposition. 

Proposition 5.1.13. Let f: [a,b] -^Mbe a bounded function. Then f is Riemann integrable if for 
every e > 0, there exists a partition P such that 

V{PJ)-L{PJ)<e. 

Proof. If for every e > 0, a P exists we have: 

~rb 



0< [ f- [ f<U{PJ)-L{PJ)<e. 

J a J a 

Therefore, J^f = J^f, and / is integrable. □ 

Example 5.1.14: Let us show that is integrable on [0,b] for any ^ > 0. We will see later that 
all continuous functions are integrable, but let us demonstrate how we can do it directly. 

Let e > be given. Take n e N and pick xj :— ib/n, to form the partition P := {xq^xi , . . . ,jc„} of 
[0,^]. We have Axj = b/„ for all j. For for any subinterval [xy_i,Xy] we obtain 

"^J = I TT~ '■ ^ ^ [^j-hXj] I = 77—, Mj = sup I : x e [xj^i,Xj] I = — . 

1^1 +JC J l+^y 1^1 +JC J Xj-i 

Then we have 

U{P,f) - L{PJ) = Axj f {Mj - mj) = 

7=1 



i-i)b/n l+ib/nj n\l+Ob/n l+nb/„J n{b+l)' 

The sum telescopes, the terms successively cancel each other, something we have seen before. 
Picking n to be such that < £ the proposition is satisfied and the function is integrable. 



5.1.3 More notation 

When / : 5 — > R is defined on a larger set S and [a,b] C S, we say that / is Riemann integrable on 
[a^b] if the restriction of / to [a,b] is Riemann integrable. In this case, we say / e ^[a,b], and we 
write /^f / to mean the Riemann integral of the restriction of / to [a,b]. 

It is useful to define the integral /*/ even if a ^b. Suppose that b <a and that / e ^[b,a], 
then define 
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For any function / we define 

r/:=o. 

J a 



At times, the variable x may already have some other meaning. When we need to write down 
the variable of integration, we may simply use a different letter. For example. 



pb rb 

j f{s) ds:= f{x) dx. 

J a J a 



5.1.4 Exercises 



Exercise S.l.l: Letf: \] ^ be defined by f{x) •.=x^ and let P := {0,0.1, OA, \}. Compute L{P,f) and 
U{P,f). 

Exercise 5.1.2; Let f: [0, 1] — )• M Z?e defined by f{x) := x. Show that f £ M[0,l] and compute Jq f using 
the definition of the integral (but feel fi-ee to use the propositions of this section). 

Exercise 5.1.3: Let f: [a,b] be a bounded function. Suppose that there exists a sequence of partitions 
{Pk} of[a,b] such that 

\im{U{PkJ)-L{PuJ)) =0. 
Show that f is Riemann integrable and that 

t f= lim U{PkJ) = lim L(P,,/). 

Exercise 5.1.4; Finish the proof of Proposition 5.1.7. 
Exercise 5.1.5: Suppose that / : [— 1 , 1] — )■ M « defined as 



1 ifx > 0, 
ifx < 0. 



Prove that f £ =^[—1,1] and compute /ij/ using the definition of the integral (but feel free to use the 
propositions of this section). 



Exercise 5.1.6: Let c G {a,b) and let J G M. Define f: [a,b\ -^M.as 



d ifx = c, 
ifx / c. 



Prove that f £^[a,b] and compute / using the definition of the integral ( but feel free to use the propositions 
of this section). 
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Exercise 5.1.7; Suppose that f: [a,b] -^M is Riemann integrable. Let e > be given. Then show that 
there exists a partition P = {xq,x\,. . . ,Xn} such that if we pick any set of numbers {ci,C2, . . . ,c„} with 
Ck G [xk-\-,Xk]for all k, then 



Exercise 5.1.8: Let f: [a,b] ^ be a Riemann integrable function. Let a > and jS G M. Then define 
g{x) := f{ax + j5) on the interval I = [^/a{a — j5),^/a{b — jS)]. Show that g is Riemann integrable on L 

Exercise 5.1.9; Let f: [a,b] ^ be a Riemann integrable function. Let a > and j3 G M. Then define 
g{x) := f{ax + P) on the interval I = [i/a(a — l5),^/a{b — P)]. Show that g is Riemann integrable on L 

Exercise 5.1.10; Let f: [0,1] be a bounded function. Let P„ = {xq,xi ,x„} be a uniform partition of 
[0, 1], that is Xj := J/n. Is {L(P„,/)}~^j always monotone? Yes/No: Prove or find a counterexample. 

Exercise 5.1.11; For a bounded function f: [0, 1] — t- M let Rn '■= Y!]=i /(V") ( uniform right hand 
rule), a) If f is Riemann integrable show Jq f = lim/?„. b) Find an f that is not Riemann integrable, but 
Um R„ exists. 
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5.2 Properties of the integral 

Note: 2 lectures, integrability of functions with discontinuities can safely be skipped 
5.2.1 Additivity 

The next result we prove is usually referred to as the additive property of the integral. First we prove 
the additivity property for the lower and upper Darboux integrals. 

Lemma 5.2.1. Suppose a <b < c and f: [a, c] — t- M is a bounded function. Then 



and 




Proof. If we have partitions Pi = {xo,^i, . . . ,Xk} of [a,b] and/'2 = {^k^^k+ii- ■ ■ ^^n} of [b,c], then 
the set P := Pi[JP2 = {xq.xi,. . . ,Xn} is a partition of [a,c]. Then 

n k n 

L{P. /) = E mjAxj = mjAxj + £ nijAxj = L{Pi , /) + L(P2, /) • 

When we take the supremum of the right hand side over all Pi and Pz, we are taking a supremum 
of the left hand side over all partitions P of [a, c] that contain b. If Q is any partition of [a, c] and 
P — {b}, then P is a refinement of Q and so L{Q, f) < L{P, f) . Therefore, taking a supremum 
only over the P that contain b is sufficient to find the supremum of L{P,f) over all partitions P, see 
Exercise 1.1.9. Finally recall Exercise 1.2.9 to compute 

sup{L(/', f):Pa partition of [a, c] } 
sup{L{P,f) : P a partition of [a, c],b e P} 

sup{L(/'i , /) + L{P2,f) : Pi a partition of [a, ^] , ^2 a partition of [b, c] } 
sup{L(i'i , /) : Pi a partition of [a, b]} -\- sup{L(P2, /) : Pi a partition of [b, c] } 

la lb ^ 
Similarly, for P, Pi , and P2 as above we obtain 

U{P,f) = f^MjAxj = Y.MjAxj+ £ MjAxj = U{Pi,f)+U{P2,f). 
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We wish to take the infimum on the right over all Pi and P2, and so we are taking the infimum 
over all partitions P of [a, c] that contain b. If Q is any partition of [a, c] and P = QU {b}, then P 
is a refinement of 2 and so U (2,/) > U (/*,/). Therefore, taking an infimum only over the P that 
contain b is sufficient to find the infimum of U (P, /) for all P. We obtain 

fC fb fC 

f= f+ f- 
Ja Ja Jb 

□ 

Theorem 5.2.2. Let a < b < c. A function f: [a,c] — )■ M is Riemann integrable if and only iff is 
Riemann integrable on [a,b\ and \b.,c\. If f is Riemann integrable, then 

Ja Ja Jb 

Proof. Suppose that / G c], then f^f — J^f — J^f. We apply the lemma to get 

PC PC pb PC rb PC pc pc 

f=f=f+f<f+f-f-f- 
Ja Ja Ja Jb Ja Jb Ja Ja 

Thus the inequality is an equality and 

J a Jb Ja Jb 

As we also know that J^f < J^f and J^f < J^f, we conclude that 

pb pb PC PC 

f=f and / /= / /• 

Ja Ja Jb Jb 

Thus / is Riemann integrable on [a,b] and [b,c] and the desired formula holds. 

Now assume that the restrictions of / to [a, b] and to [b, c] are Riemann integrable. We again 
apply the lemma to get 

PC pb PC pb PC pb PC PC 

f+ f+ f= f+ f= f- 

Ja Ja Jb Ja Jb Ja Jb Ja 

Therefore / is Riemann integrable on [a, c], and the integral is computed as indicated. □ 

An easy consequence of the additivity is the following corollary. We leave the details to the 
reader as an exercise. 

Corollary 5.2.3. Iff G ^[a,b] and [c,d] C [a,b], then the restriction f\[c4] in ^[c,d]. 
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5.2.2 Linearity and monotonicity 

Proposition 5.2.4 (Linearity). Let f and g be in ^[a,b] and a e R. 

( i) af is in M[a,b] and 

rb rb 

/ af{x) dx^ a f{x) dx. 

J a J a 

( ii) f + g is in ^[a,b] and 

rb rb 



/ f{x)+g{x)dx= f{x)dx+ g{x)dx. 

Ja Ja Ja 



Proof. Let us prove the first item. First suppose that a > 0. For a partition P we notice that (details 
are left to the reader) 

L{P, af) = aL{P,f) and U{P, af) = aU{P,f). 

For a bounded set of real numbers we can move multiplication by a positive number a past the 
supremum. Hence, 

rb 

/ a f{x) dx = sup{L(P, af) -.P a partition} 

Ja 

= sup{aL(P,/) : P a partition} 

= asup{L(P,/) : P a partition} 

l-b 

= a f{x) dx. 

Ja 

Similarly we show that 

rb rb 

/ af{x) dx= a f{x) dx. 

Ja Ja 

The conclusion now follows for a >0. 

To finish the proof of the first item, we need to show that —f{x) dx= — f{x) dx. The 
proof of this fact is left as an exercise. 

The proof of the second item in the proposition is also left as an exercise. It is not as trivial as it 
may appear at first glance. □ 

Proposition 5.2.5 (Monotonicity). Let f and g be in ^[a, b] and let f{x) < g{x) for all x E [a, b]. 
Then 

rb rb 



Ja Ja 
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Proof. Let P = {xq.xi , . . . , } be a partition of [a,b]. Then let 



nii := inf{/(jc) : x G 



and 



m := inf{^(x) : .x: G [jc/_i,x,]}. 



As /(.x:) < g{x), then < m,. Therefore, 



n n 



L{PJ) = Y,m,Axi < Y,m,Axi = L{P,g). 



(=1 (=1 



We take the supremum over all P (see Proposition 1.3.7) to obtain that 



rh rb 

r-h 

J a J a 



As / and g are Riemann integrable, the conclusion follows. 



□ 



5.2.3 Continuous functions 

We say that a function /: — )■ M has finitely many discontinuities if there exists a finite set 
S := {xi,X2, . . . ,Xn} C [a,b], and / is continuous at all points of [a,b] \ S. Before we prove that 
bounded functions with finitely many discontinuities are Riemann integrable, we need some lemmas. 
The first lemma says that bounded continuous functions are Riemann integrable. 

Lemma 5.2.6. Let f: [a,b] -^M. be a continuous function. Then f G ^[a^b]. 

Proof. As / is continuous on a closed bounded interval, it is uniformly continuous. Let £ > be 
given. Find a 5 > such that \x — y\<5 implies \ f{x) — f{y) \ < 

Let P — {xQ.xi,. . . ,Xn} be a partition of [a,b] such that Axi < 5 for all i = 1, 2, . . . ,n. For 
example, take n such that < 5 and let Xi := ^{b — a) + a. Then for all x,y & we have 

that 1^ — 3^1 < Axi < 5 and so 



As / is continuous on [x,- it attains a maximum and a minimum on this interval. Let x be a 
point where / attains the maximum and y be a point where / attains the minimum. Then f{x) = Mi 
and f{y) = mi in the notation from the definition of the integral. Therefore, 



m-f{y)<\f{x)-f{y)\< 



e 



b — a 



Mi-mi = f{x)-f{y) < 



£ 



b — a 
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And so 

/'/- tf<U{PJ)-L{PJ) 

J a J a 



n 

= Y,{Mi-mi)Axi 
<T. H^' 

£ 

(b-a) = £. 



b — a 



As £ > was arbitrary, 



h 

f= I /, 



and / is Riemann integrable on [a, Z?] . □ 

The second lemma says that we need the function to only be "Riemann integrable inside the 
interval," as long as it is bounded. It also tells us how to compute the integral. 

Lemma 5.2.7. Let f: [a,b\ — t- M Z?e a bounded function that is Riemann integrable on [a' ,b'] for 
all a' , b' such that a < a' <b' <b. Then f E ^[a,b]. Furthermore, if a < an < b„ < b are such that 
Um a„ = a and lim bn = b, then 



fb ft>n 

/ /=lim / /. 

J a Ja„ 



Proof. Let M > be a real number such that \ f{x) \ < M. Pick two sequences of numbers a < a„ < 
b„ < b such that lim an = a and lim bn = b. Note that M > and {b — a) > {b„ — an). We thus have 



-M{b-a) < -M{bn-an) < f " f <M{bn-an) <M{b-a). 

Ja„ 

Thus the sequence of numbers is bounded and by Bolzano- Weierstrass has a convergent 

" B 

subsequence indexed by nj^. Let us call L the limit of the subsequence {/a,'^'' 

Lemma 5.2.1 says that the lower and upper integral are additive and the hypothesis says that / 

is integrable on [a„^,Z7„^]. Therefore 



f= f+ f+ f>-M{an,-a)+ f-M{b-bn,j. 

Ja Ja J cin, •>b„, Jdn, 
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We take the limit as k goes to oo on the right-hand side, 




/> -M-0 + L-M-0 = L. 



Next we use additivity of the upper integral. 



fb ra„ j-b„ rb j-b„ 

f= f+ f+ f<M{an,-a)+ f + M{b-bn,). 

Ja Ja Ja„i^ Jb„i^ J a„^, 

b 

We take the same subsequence f}'k=i t^ke the limit to obtain 




/<M-0 + L + M-0 = L. 



Thus /^f / = jj^f = L and hence / is Riemann integrable and /^f / = L. In particular, no matter what 
sequences {un} and {bn} we started with and what subsequence we chose the L is the same number. 

To prove the final statement of the lemma we use Theorem 2.3.7. We have shown that every con- 
vergent subsequence {/a„'^* /} converges to L = f. Therefore, the sequence {j^'^ /} is convergent 
and converges to L. □ 

Theorem 5.2.8. Let f: [a,b\ -^Rbe a bounded function with finitely many discontinuities. Then 
feM[a,b]. 

Proof. We divide the interval into finitely many intervals [a,,Z?/] so that / is continuous on the 
interior If / is continuous on (a,,Z?/), then it is continuous and hence integrable on [c,-, J,] 

for all a, < < di < bi. By Lemma 5.2.7 the restriction of / to [a,, bi\ is integrable. By additivity 
of the integral (and induction) / is integrable on the union of the intervals. □ 

Sometimes it is convenient (or necessary) to change certain values of a function and then 
integrate. The next result says that if we change the values only at finitely many points, the integral 
does not change. 

Proposition 5.2.9. Let / : [a, Z?] — M be Riemann integrable. Let g: [a, Z?] — )■ M be a function such 
that f{x) = g{x) for all x G [a, b] \ S, where S is a finite set. Then g is a Riemann integrable function 
and 




Sketch of proof. Using additivity of the integral, we split up the interval [a,b\ into smaller intervals 
such that f{x) = g{x) holds for all x except at the endpoints (details are left to the reader). 

Therefore, without loss of generality suppose that f{x) = g{x) for all x E {a,b). The proof 
follows by Lemma 5.2.7, and is left as an exercise. □ 
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5.2.4 Exercises 

Exercise 5.2.1; Let f be in ^[a,b]. Prove that —f is in ^[a,b] and 

rb rh 

/ —f{x) dx = — f{x) dx. 

Ja Ja 

Exercise 5.2.2: Let f and g be in ^[a,b]. Prove that f + g is in ^[a,b] and 

rb rb rb 

/ f{x)+g{x)dx= / f{x)dx+ / g{x) dx. 

Ja Ja J a 

Hint: Use Proposition 5.1.7 to find a single partition P such that U{P,f) —L{P,f) < e/2 and U{P,g) 
L{P,g)<^J^. 

Exercise 5.2.3: Let f: [a,b] be Niemann integrable. Let g: [a,b] be a function such that f{x) 
g{x)for all X G {a,b). Prove that g is Riemann integrable and that 



fh fb 
J a J a 



Exercise 5.2.4; Prove the mean value theorem for integrals. That is, prove that iff: [a,b] — )• M is continuous, 
then there exists a c £ [a,b] such that f = f{c){b — a). 

Exercise 5.2.5; Iff: [a,b] -^M.is a continuous function such that f{x) > Ofor all x G [a,b] and Ja f = 0. 
Prove that f{x) = Ofor all x. 

Exercise 5.2.6; If f: \a,b] is a continuous function for all x G [a,b] and Xf / = 0. Prove that there 
exists a c £ [a,b] such that f{c) = (Compare with the previous exercise). 

Exercise 5.2.7; If f: [a,Z>] — M and g: [a,b] — M are continuous functions such that f = g. Then 
show that there exists a c £ [a, ft] such that f{c) = g{c). 

Exercise 5.2.8; Let f £ Si[a,b]. Let a,P,Y be arbitrary numbers in [a,b] (not necessarily ordered in any 
way). Prove that 

i-y rP rr 
f= f+ f- 



Recall what f means ifb<a. 
Exercise 5.2.9; Prove Corollary 5.2.3. 

Exercise 5.2.10; Suppose that f: [a,b] is bounded and has finitely many discontinuities. Show that 
as a function ofx the expression \f{x)\ is bounded with finitely many discontinuities and is thus Riemann 
integrable. Then show that 

rh 

f[x) dx 



< I \f{x)\ dx. 

J a 
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Exercise 5.2.11 (Hard); Show that the Thomae or popcorn function (see Example 3.2.12) is Riemann 
integrable. Therefore, there exists a function discontinuous at all rational numbers (a dense set) that is 
Riemann integrable. 

In particular, define /: [0, 1] — )■ M by 




^/k ifx = '"/*: where m,k £N and m and k have no common divisors, 
ifx is irrational. 



Show that Jq f = 0. 

If 7 C M is a bounded interval, then the function 

(pf^^y-l^ if;cG/, 

1 otherwise, 

is called an elementary step function. 

Exercise 5.2.12; Let I be an arbitrary bounded interval (you should consider all types of intervals: closed, 
open, half-open) and a < b, then using only the definition of the integral show that the elementary step 
function (pj is integrable on [a,b\, and find the integral in terms of a, b, and the endpoints of I. 

When a function / can be written as 

n 

fix) = J^ak(pi,{x) 

k=\ 

for some real numbers ai , a2 , • • • , «n and some bounded intervals /i , /2 ,...,/„ , then / is called a step function. 

Exercise 5.2.13; Using the previous exercise, show that a step function is integrable on any interval [a, b]. 
Furthermore, find the integral in terms of a, b, the endpoints ofl^ and the Uk. 

Exercise 5.2.14; Let / : [a, Z?] — t- M be increasing, a) Show that f is Riemann integrable. Hint: Use a uniform 
partition; each subinterval of same length, b) Use part a to show that a decreasing function is Riemann 
integrable. c) Suppose h = f — g where f and g are increasing functions on [a,b]. Show that h is Riemann 
integrable^ 

Exercise 5.2.15 (Challenging); Suppose that f £ ^[a,b], then the function that takes x to \f{x)\ is also 
Riemann integrable on [a,b]. Then show the same inequality as Exercise 5.2.10. 

Exercise 5.2.16: Suppose that f : [a,b] ^M.and g: [a, b] are bounded, a) Show Ja [f + g) ^ Ja f + Ja g 
and jj^if + g) < Sl^f + Xfg- b) Find example f and g where the inequality is strict. Hint: f and g should not 
be Riemann integrable. 



*Such an h is said to be of bounded variation. 
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5.3 Fundamental theorem of calculus 

Note: 1.5 lectures 

In this chapter we discuss and prove the fundamental theorem of calculus. The entirety of 
integral calculus is built upon this theorem, ergo the name. The theorem relates the seemingly 
unrelated concepts of integral and derivative. It tells us how to compute the antiderivative of a 
function using the integral and vice-versa. 



5.3.1 First form of the theorem 

Theorem 5.3.1. Let F: [a,b] ^M.be a continuous function, differentiable on (a, b). Let f G^[a, b] 
be such that f{x) = F' (x) for x G {a,b). Then 



t f = F{b)-F{a). 

J a 



It is not hard to generalize the theorem to allow a finite number of points in [a, b] where F is not 
differentiable, as long as it is continuous. This generalization is left as an exercise. 

Proof. Let P = {xq.xi, . .. ,x„} be a partition of [a,b]. For each interval [jc,_i,jc,], use the mean 
value theorem to find a c, G such that 

f{ci)Axi = F'{ci){xi-Xi-i) =F{xi) -F{xi-i). 

Using the notation from the definition of the integral, we have < /(c,) < Mi and so 

MiAxi < F{xi) - F{xi-i) < MiAxi. 

We sum over i= 1 , 2, . . . , n to get 

£m,M <f^{F{xi)~F{x,^i)) < f^M,Ax,. 

i=l i=l i=l 

Notice that in the middle sum all the terms except the first and last cancel out and we end up with 
F{xn) — F{xo) = F{b) — F{a). The sums on the left and on the right are the lower and the upper 
sum respectively. So 

L{PJ)<F{b)-F{a)<U{PJ). 
We take the supremum of L{P,f) over all P and the left inequality yields 

tf<F{b)-F{a). 

J a 
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Similarly, taking the infimum of U {P,f) over all partitions P yields 

F{b)-F{a)< f'f. 

J a 



As / is Riemann integrable, we have 

rb rb 



/ / f<F{b)-F{a)< f. 

Ja Jg Ja Ja 

The inequalities must be equalities and we are done. 



□ 



The theorem is used to compute integrals. Suppose we know that the function f{x) is a derivative 
of some other function then we can find an explicit expression for f. 

Example 5.3.2: Suppose we are trying to compute 

/ dx. 
Jo 

We notice thatx^ is the derivative of y. We use the fundamental theorem to write 

/■I 2 , 1^ 0^ 1 

/ X dx= — z- = z^- 

Jo 3 3 3 

5.3.2 Second form of the theorem 

The second form of the fundamental theorem gives us a way to solve the differential equation 
F'{x) — f{x), where f{x) is a known function and we are trying to find an F that satisfies the 
equation. 

Theorem 5.3.3. Let f: [a, b]^M.be a Riemann integrable junction. Define 

Ja 

First, F is continuous on [a,b]. Second, iff is continuous at c & [a,b], then F is dijferentiable at c 
and F'{c) = f{c). 

Proof. As / is bounded, there is an M > such that \f{x) \ < M for all a: G [a, b] . Suppose x,y E [a, b] 
with x>y. Then 

\Fix)-F{y)\ 

By symmetry, the same also holds ifx<y. So F is Lipschitz continuous and hence continuous. 











/'/- /■'/ 






<M\x-y\ 


J a J a 




Jy 
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Now suppose that / is continuous at c. Let e > be given. Let 5 > be such that for x e [a, b] 
\x — c\<5 impUes |/(jc) — /(c) | < e. In particular, for such x we have 

/(c)-e</(x)</(c) + e. 

Thus if jc > c, then 

(/(c) -e){x-c)< jj < {f{c) + e){x-c). 
When c>x, then the inequalities are reversed. Therefore, assuming that cj^xwe get 

/(c)-e< J^</(c) + 8. 

As 

F(x)-F(c) _ /;/-/;/ _ /;/ 

Xi C X' C JC c 

we have that 



x — c 



The result follows. It is left to the reader to see why it is OK that we just have a non-strict 
inequality. □ 

Of course, if / is continuous on [a^b], then it is automatically Riemann integrable, F is differen- 
tiable on all of [a, b] and F' (x) = f{x) for all x e [a, b] . 

Remark 5.3.4. The second form of the fundamental theorem of calculus still holds if we let d e [a, b] 
and define 

F{x) [f. 

J a 

That is, we can use any point of [a^b] as our base point. The proof is left as an exercise. 

A common misunderstanding of the integral for calculus students is to think of integrals whose 
solution cannot be given in closed-form as somehow deficient. This is not the case. Most integrals 
we write down are not computable in closed-form. Even some integrals that we consider in closed- 
form are not really such. For example, how does a computer find the value of In jc? One way to do it 
is to note that we define the natural log as the antiderivative of i/x such that In 1 = 0. Therefore, 



Inx J i/i ds. 



Then we can numerically approximate the integral. So morally, we did not really "simpHfy" Jj* i/s ds 
by writing down \nx. We simply gave the integral a name. If we require numerical answers, it is 
possible that we end up doing the calculation by approximating an integral anyway. 
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Another common function defined by an integral that cannot be evaluated symbolically is the 
erf function, defined as 

2 _.2 

erf(jc) := —= / e ^ ds. 



This function comes up often in applied mathematics. It is simply the antiderivative of {^/-/n) e^^ 
that is zero at zero. The second form of the fundamental theorem tells us that we can write the 
function as an integral. If we wish to compute any particular value, we numerically approximate the 
integral. 



5.3.3 Change of variables 

A theorem often used in calculus to solve integrals is the change of variables theorem. Let us prove 
it now. Recall that a function is continuously differentiable if it is differentiable and the derivative is 
continuous. 

Theorem 5.3.5 (Change of variables). Let g: [a.,b\^M.be a continuously differentiable function. 
Ifg{[a,b]) C [c,d] and f: [c,d] — )■ M is continuous, then 

/ f{g{x))g\x)dx= / ns)ds. 

Ja Jg(a) 

Proof. As g, g' , and / are continuous, we know that f{g{x)) g'{x) is a continuous function on [a, b], 
therefore it is Riemann integrable. 
Define 

F{y) := r f{s) ds. 

By the second form of the fundamental theorem of calculus (using Exercise 5.3.4 below) F is a 
differentiable function and F'{y) = f{y). Now we apply the chain rule and write 

{Fogyix)=F'{g{x))g'{x)=f{g{x))g'{x). 

We note that F {g{a)) = and we use the first form of the fundamental theorem to obtain 

/ f{s)ds = F{g{b)) =F{g{b))-F{g{a)) = / {Fog) {x)dx= / f{g{x))g\x)dx. 

J g(a) Ja Ja 

□ 



The change of variables theorem is often used to solve integrals by changing them to integrals 
that we know or that we can solve using the fundamental theorem of calculus. 
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Example 5.3.6: From an exercise, we know that the derivative of sin(x) is cos(jc). Therefore we 
solve 

/■^ / 7x , f^cos(s) , I , , , sin(;r)-sin(0) ^ 

/ xcos x^) dx = / — ^ ds=- cos 5) ds = — ^ = 0. 

Jo ^ ' h 1 Ih ^ ' 2 

However, beware that we must satisfy the hypotheses of the theorem. The following example 
demonstrates why we should not just move symbols around mindlessly. We must be careful that 
those symbols really make sense. 

Example 5.3.7: Suppose we write down 

/•I hilxl , 
/ dx. 

J-l X 

It may be tempting to take g{x) :—\n\x\. Then take g'{x) — ^ and try to write 

/ s ds — s ds = 0. 
Jg{-1) Jo 

This "solution" is incorrect, and it does not say that we can solve the given integral. First problem is 
that is not continuous on [—1, 1]. Second, is not even Riemann integrable on [—1,1] (it is 
unbounded). The integral we wrote down simply does not make sense. Finally, g is not continuous 
on [—1, 1] either. 



5.3.4 Exercises 



■X 

s 



Exercise 5.3.1; Compute ^ / ds ]. 



Exercise 5.3.2: Compute ^( [ sin(s^) ds] 

dx \Jo J 



Exercise 5.3.3; Suppose F : [a,b] -^M. is continuous and dijferentiable on [a,b\\ S, where S is a finite set. 
Suppose there exists an f £ M[a,b] such that f (x) = F' (x) for x £ [a,b]\S. Show that f = F{b) — F{a). 

Exercise 5.3.4; Let f: [a,b] -^Rbe a continuous function. Let c G {a,b\ be arbitrary. Define 

F{x):=iy. 

Prove that F is differentiable and that F' (x) = f{x) for all x £ [a,b\. 

Exercise 5.3.5; Prove integration by parts. That is, suppose that F and G are continuously differentiable 
functions on [a,b\. Then prove 

f F{x)G'{x) dx = F{b)G{b)-F{a)G{a)- f F' {x)G{x) dx. 

J a J a 
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Exercise 5.3.6; Suppose that F and G are dijferentiahle functions defined on [a,b\ such that F'[x) = G'{x) 
for all X G [a, b\. Using the fundamental theorem of calculus, show that F and G differ by a constant. That is, 
show that there exists a C G M such that F{x) — G{x) = C. 

The next exercise shows how we can use the integral to "smooth out" a non-differentiable function. 

Exercise 5.3.7: Let f: [a,b\ — > M fee a continuous function. Let e >0 be a constant. Forx G [a + e,b — e], 
define 

a) Show that g is dijferentiable and find the derivative. 

b) Let f be differentiable and fix X ^ {a,b) (let £ be small enough). What happens to g' (x) as e gets smaller? 

c) Find g for fix) := \x\, £ = 1 (you can assume that {a,b\ is large enough). 

Exercise 5.3.8; Suppose that f: [a,b] — > M is continuous. Suppose that J^f = fforallx G [a,b]. Show 
that f{x) = Q for all X G [a,b\. 

Exercise 5.3.9; Suppose that f: [a, fe] — > M is continuous and J^f = Ofor all rational x in [a, b]. Show that 
f{x) = Ofor all X G [a,b]. 

Exercise 5.3.10; A function f is an odd function if f{x) = —f{—x), and f is an even function if f{x) = f{—x). 
Let a> 0. Assume f is continuous. Prove: a) If f is odd, then Jl^f t>) Vf even, then = 2 /q /. 

Exercise 5.3.11: a) Show that f{x) := sm(}/x) is integrable on any interval (you can define f{0) to be 
anything), b) Compute j\sm(^/x)dx. (Mind the discontinuity) 
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5.4 The logarithm and the exponential 

Note: 1 lecture ( optional, requires the optional sections §3.5, §3.6, §4.4) 

We now have all that is required to finally properly define the exponential and the logarithm that 
you know from calculus so well. First recall that we have a good idea of what means as long as n 
is a positive integer. Simply, 




n times 



It makes sense then to define x := \. For negative integers we define x^" := i/x". If x > we 
mentioned before that jc^/" is defined as the unique positive nth root. Finally for any rational number 
"/ra, we define 

However, what do we mean by y/l ? Or x^ in general? In particular, what is e^ for all xl This 
section answers these questions and more. 

5.4.1 The logarithm 

It is convenient to start with the logarithm. Let us show that a unique function with the right 
properties exists, and only then will we call it the logarithm. 

Proposition 5.4.1. There exists a unique Junction L : (0,°°) — > M such that 

(i) L(1)=0. 

(ii) L is differentiable and L'{x) = i/x. 

( Hi) L is strictly increasing, bijective, and 

YimL{x) = — oo, and limL(.x;) = oo. 

(iv) L{xy) =L{x)+L{y)forallx,ye (0,oo). 

(v) Ifq is a rational number then L{x'') = qL{x). 

Proof. To prove existence, let us define a candidate and show it satisfies all the properties. Define 

L{x) := - dt. 
J\ t 

Obviously (i) holds. Property (ii) holds via the fundamental theorem of calculus. 
To prove property (iv), we change variables u = yt io obtain 

rx I pxy I rxy I py J 

L(x) = - dt = - du= - du— - du = L(xy) — L(y) . 
ii ? Jy u Ji u Ji u 
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That L is strictly increasing and hence one-to-one follows from (ii) and the fact that L'{x) = 
i/x > for X > 0. Let us show that L is onto. First, as i/f > 1/2 if ? G [1,2], 

/■2 1 

L(2) = / -dt> 1/2. 
Ji t 

By induction, (iv) implies that for n e N 

L(2") = L(2) +L(2) + ■ ■ ■ +L(2) = nL{2). 

Given any y > 0, by the Archimedean property of the real numbers, (because L(2) > 0) there is 
an n G N such that L(2") > y. By the intermediate value theorem there is an x\ E (1,2") such that 
L(xi) = y. We get that (0, 0°) is in the image of L. As L is increasing L(x) > y for all x > 2", and so 

lim L{x) = 00. 

Next = L{^/x) = L{x) +L{i/x), and so L{x) = — L(i/x). Using x = 2^", we obtain as above that L 
achieves all negative numbers. And 

\imL{x) = lim— L(i/x) = lim —L{x) = —00. 

X^O X^O X^oa 

In the limits note that only .x: > are in the domain of L. 

Let us now prove (v). As above, (iv) implies for n G N we have L{x") = nL{x). We have already 
seen that L{x) = -L(J/7) so L{x-") ~-L{x") = -nL{x). Then for m G N 

L(;c) =l((x1/'")"') =mL(jci/'"). 

Putting everything together for n G Z and m G N we have L(jc"/'") = nL{x^/'^) = {n/m)L{x). 

Finally for uniqueness, let us use properties (i) and (ii). Via the fundamental theorem of calculus 

L{x) = [ - dt 
Ji t 

is the unique function such that L(l) =0 and L\x) = i/x. □ 

Now that we proved that there is a unique function with these properties we simply define the 
logarithm or sometimes called the natural logarithm: 

\n{x) :=L{x). 

Often mathematicians write log(.x;) instead of ln(.x;), which is more familiar to calculus students. 
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5.4.2 The exponential 

Just as with the logarithm we define the exponential via a list of properties. 

Proposition 5.4.2. There exists a unique function £ : M — (0, oo) such that 

(i) E{0) = 1. 

(ii) E is differentiable and E'{x) =E{x). 
( Hi) E is strictly increasing, bijective, and 

hm E{x)=0, and \imE{x)=oo, 



(iv) E{x + y)=E{x)E{y)forallx,yeR. 

(v) IfqeQ, then E{qx) =E{xy. 

Proof. Let us prove existence of such a function by defining a candidate again, and prove that 
it satisfies all the properties. As L defined above is invertible, let E be the inverse function of L. 
Property (i) is immediate. 

Property (ii) follows via the inverse function theorem, in particular Lemma 4.4.1. L satisfies all 
the hypotheses of the lemma and hence 

E'(x) = -^^——=E(x). 

Next let us prove property (iii). The function E is strictly increasing since E{x) > and 
E'{x) = E(x) > 0. As E is the inverse of L, it must also be bijective. To show the limits, notice that 
E{x) is increasing and onto (0, oo) . Then for every M > 0, there is an xq such that E{xq) = M and 
E(x) >M for all x>xo. Similarly for every £ > 0, there is an xq such that ^(jco) = £ and £'(xo) < £ 
for all X <xo. Therefore, 

lim E{x) = 0, and lim E{x) = oo. 

To prove property (iv) we use the corresponding property for the logarithm. Take G M. As L 
is bijective, find a and b such that x = L{a) and y = L{b). Then 

E{x + y)=E{L{a)+L{b)) =E{L{ab)) = ab = E{x)E{y). 

Property (v) also follows from the corresponding property of L. Given .x: G M, let a be such that 

X = L{a) and 

E{qx) =E{qL{a))E{L{a'^)) =a'i = E{xY. 
Finally, uniqueness follows from (i) and (ii). Let E and F be two functions satisfying (i) and (ii). 

^(f{x)E{-x)^ = F'{x)E{-x) ~ E'{-x)F{x) = F{x)E{-x) - E{-x)F{x) = 0. 
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Therefore by Proposition 4.2.5, F{x)E{—x) = F(0)£'(— 0) = 1 for all x G M. Doing the computation 
with F = E,we obtain E(x)E{—x) = 1. Then 

0=1-1= F{x)E{-x)-E{x)E{-x) = {F{x) -E{x))E{-x). 

Since E{x)E{—x) = 1, then £■(— .jc) ^ for all x. So F{x) —E{x) =0 for all x and we are done. □ 

Having proved that E is unique, we define the exponential function as 

exp(.x:) := E{x). 

We can now make sense of exponentiation x^ for arbitrary numbers when .x; > 0. First suppose 
that e Q. Then 

x^ = exp (in(x^) ) = exp {y \n{x) ) . 
Therefore when x> and y is irrational let us define 

x^ := exp()^ln(x)). 

As exp is continuous then # is a continuous function of y. Therefore, we would obtain the same 
result had we taken limits of rational numbers approaching y. 
Define the number e as 

e := exp(l). 

The number e is sometimes called Euler's number or the base of the natural logarithm. We notice 
that 

e^ = exp(xln(e)) = exp(x). 

We have justified the notation g-^ for exp(jc) . 

Finally, let us extend properties of logarithm and exponential to irrational powers. The proof is 
immediate. 

Proposition 5.4.3. Letx,y e M. 

(i) exp{xy) = (exp(x))^. 

(ii) lfx>0 then ln(x-^) = 3;ln(;c). 

5.4.3 Exercises 

Exercise 5.4.1; Let y be any real number and b > 0. Define f: (0, oo) — )• M and g: M — )■ R a^, f{x) := 
and g{x) := b"^. Show that f and g are dijferentiable and find their derivative. 

Exercise 5.4.2; Let b > be given. 

a) Show that for every y > 0, there exists a unique number x such that y = b^. Define the logarithm base b, 
log^ : (0,oo) ^ M, by log,,{y) := x. 

b) Show that log^(x) = 

c) Prove that ifoO, then log^(x) = 

d) Prove logi,{xy) = log,^{x) + log,^{y), and log^(x>') = ylog,,{x). 
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Exercise 5.4.3 (requires §4.3); Use Taylor's theorem to study the remainder term and show that for allxGM 

oo /J 

Hint: Do not differentiate the series term by term (unless you would prove that it works). 
Exercise 5.4.4; Use the geometric sum formula to show 

i-t+t''----+{-irt'^ = -^-^-^^. 

Using this fact show 

n=l 

for all X £ ( — 1,1] (notethatx= 1 is included). Finally find the limit of the alternating harmonic series 

y ^ ^ = 1 - 1/2+ 1/3 - 1/4+ • • • = ln2. 

n=l " 

Exercise 5.4.5; Show that 

= lim f 1 + ^V. 

Hint: Take the logarithm. 

Note: The expression (l + arises in compound interest calculations. It is the amount of money in a bank 
account after I year if I dollar was deposited initially at interest x and the interest was compounded n times 
during the year. Therefore e^ is the result of continuous compounding. 

Exercise 5.4.6; a ) Prove that for « G N we have 



n 1 n— 1 1 

k=2 k=\ 



b) Prove that the limit 

( " 

k 



exists. This constant is known as the Euler-Mascheroni constant*. It is not known if this constant is rational 
or not, it is approximately y « 0.5772. 



Exercise 5.4.7; Show 



lim = 0. 



A'^oo X 



* Named for the Swiss mathematician Leonh ard Paul Euler (1707 - 1783) and the Italian mathematician Lorenzo 
Mascheroni (1750- 1800). 
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Exercise 5.4.8: Show that is convex in the sense that if a <x<b then e^ < ^"^zf +^^fzf • 
Exercise 5.4.9: Using the logarithm find 

limni/". 

Exercise 5.4.10: Show that E{x) = e^ is the unique continuous fimction such that E(x + y) = E{x)E{y) and 
E(\) = e. Similarly prove thatL{x) = ln(x) is the unique continuous fimction defined on positive x such that 
L{xy) = L{x) +L{y) and L{e) = 1. 
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5.5 Improper integrals 

Note: 2-3 lectures (optional section, can safely be skipped, requires the optional §3.5) 

Often it is necessary to integrate over the entire real line, or a infinite interval of the form [a, oo) 
or (oo, b] . Also, we may wish to integrate functions defined on a finite interval (a, b) but not bounded. 
Such functions are not Riemann integrable, but we may want to write down the integral anyway in 
the spirit of Lemma 5.2.7. These integrals are called improper integrals, and are limits of integrals 
rather than integrals themselves. 

Definition 5.5.1. Suppose /: — )■ M is a function (not necessarily bounded) that is Riemann 
integrable on [a, c] for all c < Z?, then we define 



/'/:= lim /V, 

Ja c^b Ja 



if the limit exists. 

Suppose that /: [a,oo) — t- M is a function such that / is Riemann integrable on [a,c] for all 
c <oo. Then we define 

/■CO /.c 

/ /:=lim / /, 

J a J a 

if the limit exists. 

If the limit exists, we say that the improper integral converges. If the limit does not exist, we say 
that the improper integral diverges. 

We similarly define improper integrals for the left hand endpoint, we leave this to the reader. 

For a finite endpoint b, using Lemma 5.2.7 we see that if / is bounded, then we have defined 
nothing new. What is new is that we can apply this definition to unbounded functions. The following 
set of examples is so useful that we state it as a proposition. 

Proposition 5.5.2 (p-test for integrals). The improper integral 

— dx 
1 xP 

converges to if p> \ and diverges ifO< p < 1. 
The improper integral 

— dx 
xP 

converges to ifO<p<l and diverges if p> 1. 

Proof. The proof follows by application of the fundamental theorem of calculus. Let us do the 
proof for p > 1 for the infinite right endpoint, and we leave the rest to the reader. Hint: You should 
handle p = I separately. 



5.5. IMPROPER INTEGRALS 



111 



Suppose that p> I. Then 

rb I rb lj-p+\ 1 1 

/ — dx= I X P dx= = — 7 — - — H . 

J\ xP Jl -p+l -p+l (p-l)bP-^ p-l 

As /? > 1, then p—l > 0. Taking the limit as ^ oo we obtain that goes to 0, and the result 
follows. □ 

We state the following proposition for just one type of improper integral, though the proof is 
straight forward and the same for other types of improper integrals. 

Proposition 5.5.3. Let f: [a, oo) — > R be a function that is Riemann integrable on [a, b] for all b> a. 
Given any b> a, f converges if and only if f converges, in which case 

i-oo i-b pea 

/ /=//+/ /• 

Ja Ja Jb 

Proof Let c > b. Then 

/"/=/"/+/"/. 

Ja Ja Jb 

Taking the limit c — > £» finishes the proof. □ 

Nonnegative functions are easier to work with as the following proposition demonstrates. The 
exercises will show that this proposition holds only for nonnegative functions. Analogues of this 
proposition exist for all the other t)^es of improper limits are left to the student. 

Proposition 5.5.4. Suppose that f: [a, oo) — )■ M is nonnegative (f{x) > Ofor all x) and such that f 
is Riemann integrable on [a, b] for all b > a. 

(i) 

J / = sup|y /:j:>a|. 

( ii) Suppose that {xn} is a sequence with lim Xn = °°. Then f converges if and only if \im J^" f 
exists in which case 

/ /=lim / /. 

In the first item we allow for the value of oo in the supremum indicating that the integral diverges 
to infinity. 

Proof. Let us start with the first item. Notice that as / is nonnegative, then f^f is increasing as a 
function of x. If the supremum is infinite, then for every M G M we find such that f^ f > M. As 
f^f is increasing then f^f>M for all x>N. So f diverges to infinity. 
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Next suppose that the supremum is finite, say A = sup {J^ f : x > a}. For every e > 0, we can 
find an A^^ such that A — f < £. As faf^s. increasing, then A — f < £ for all x > N and hence 
converges to A. 

Let us look at the second item. If f converges then every sequence {xn} going to infinity 
works. The trick is proving the other direction. Suppose that is such that lim Xn = °° and 



converges. Given e > 0, pick A'^ such that for all n > we have A — e < J^" f < A + £. Because 
/^/ is increasing as a function of x, we have that for all x>X!^ that 



As {xn} goes to oo, then for any given x, there is an Xm such that m>N and x < x,n- Then 



Proposition 5.5.5 (Comparison test for improper integrals). Let f: [a, oo) — )• M and g: [a, oo) — )■ M 

be functions that are Riemann integrable on [a, b] for all b > a. Suppose that for all x> a we have 



(i) If g converges, then f converges, and in this case < g- 

(ii) If f diverges, then g diverges. 

Proof. Let us start with the first item. For any b and c, such that a < b < c, we have —g{x) < 
fix) < g{x), and so 






In particular, for all x>xn we have f —A\ < £. 



□ 



1/(^)1 <^(^)- 




In other words, |/^'7I < IbE- 

Let e > be given. Because of Proposition 5.5.3 we have 




As g goes to g as b goes to infinity, then g goes to as goes to infinity. Choose B such 



that 
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As g is positive, then if B < b < c, then /^g < e as well. Let {jc„} be a sequence going to infinity. 
Let M be such that Xn>B for all n>M. Take n,m>M, with jc„ < j:^. 



y ^ ^ y ^ -y 

Ja Ja Jx„ Jx. 



< e. 



Therefore the sequence {/^" is Cauchy and hence converges. 

We need to show that the limit is unique. Suppose {x„} is a sequence converging to infinity 
such that {/^" /} converges to Li, and {yn} is a sequence converging to infinity is such that {/J" /} 
converges to L2 . Then there must be some n such that | f — Li \ < e and | / — L2 1 < e. We can 
also suppose that Xn>B and j„ > 5. Then 





fXn 








ryn 




ryn 




U- f 


+ 


y ^- ^ 


+ 


/ f-Li 


< e + 


y ^ 




Ja 




J a J a 




J a 




JXn 



+ e < 3e. 



(5.4) 



As e > was arbitrary, Li = L2, and hence /^°°/ converges. Above we have shown that |/^/| < ^ 
for all c > a. By taking the limit c — > 00, the first item is proved. 

The second item is simply a contrapositive of the first item. □ 



Example 5.5.6: The improper integral 



Jo 



X 



3 + 1 



dx 



converges. 

Proof: First note that we simply need to show that the integral converges when going from 1 to 
infinity. For x > 1 we obtain 



sm{x^){x + 2) 



x^ + l 



x + 2 x + 2 x + 2x 3 

< < — ^ < — ^ < 



Then 



;c3 + l 

» / \ dx — lim / 

Ji x^ c-^°°Ji 



X-' 



.2- 



dx. 



x^ 



So the integral converges. 



Example 5.5.7: You should be careful when doing formal manipulations with improper integrals 
For example, 

dx 



wnte 



2 x^-l 

converges via the comparison test again using ^. However, if you succumb to the temptation to 

2 



1 



1 



X- 



^ — 1 x—l x+1 
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and try to integrate each part separately, you will not succeed. It is not true that you can split the 
improper integral in two; you cannot split the limit. 



The last line in the computation does not even make sense. Both of the integrals there diverge to 
infinity since we can apply the comparison test appropriately with i/x. We get oo — oo. 

Now let us suppose that we need to take limits at both endpoints. 

Definition 5.5.8. Suppose /: — )■ M is a function that is Riemann integrable on [c,d] for all c, 
d such that a < c < d <b, then we define 



if the limits exist. 

Suppose that / : R — > R is a function such that / is Riemann integrable on all finite intervals 
[a,b]. Then we define 



if the limits exist. 

We similarly define improper integrals with one infinite and one finite improper endpoint, we 
leave this to the reader. 

One ought to always be careful about double limits. The definition given above says that we 
first take the limit as d goes to or oo for a fixed c, and then we take the Umit in c. We will have to 
prove that in this case it does not matter which limit we compute first. 

Example 5.5.9: Let us see an example: 






/ := lim lim 




1 



Ja 



■b I 



dx = lim hm (arctan(&) — arctan(fl)) = Ti. 



1+JC2 



dx — lim lim 



1+JC2 



— oo 



Z?— >oo 



In the definition the order of the limits can always be switched if they exist. Let us prove this 
fact only for the infinite Umits. 
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Proposition 5.5.10. Iff : M ^ M /5 a function integrable on every interval. Then 

rb rb 
a lira / 



rb rb 

lim lira / / converges if and only if lira lira / / converges, 



in which case the two expressions are equal. If either of the expressions converges then the improper 
integral converges and 

pa poo 

lim / / /. 



a— >oo 



Proof. Without loss of generality assume a < and b>0. Suppose the first expression converges. 
Then 

Um lim lim lim ( f f+ t f\ = ( lim f] + (lim f] 

a-^-oo h^co J a a^-°o b-^oo\Ja Jo ) \c^-°°Ja ) \b-^<=oJo ) 

= lim lim ['f) + f f\ = lim lim ( ff+ f f\ . 

Similar computation shows the other direction. Therefore, if either expression converges then the 
improper integral converges and 



/ = lim lim / / = lim / / + lim / / 

-oo a^-oo h^oo Ja \a^-°°Ja J \b-^oojQ ^ 

= (lim f f\ + f lim rf\ = lim( r f+ [" f] = lim f" f. 



□ 



Example 5.5.11: On the other hand, you must be careful to take the limits independently before 
you know convergence. Let f{x) = A for x and /(O) = 0. If a < and & > 0, then 



rb rO rb 

f= f+ f = a + b. 

Ja Ja Jo 



For any fixed a <0 the limit as — )■ 0° is infinite, so even the first limit does not exist, and hence 
the improper integral of / from —00 to 00 does not converge. On the other hand if a > 0, then 



Therefore, 



f f^{-a)+a = 0. 

J— a 

lim r f = 0. 



182 



CHAPTER 5. THE RIEMANN INTEGRAL 



Example 5.5.12: An example to keep in mind for improper integrals is the so-called sine function* . 
This function comes up quite often in both pure and applied mathematics. Define 




4 

Figure 5.2: The sine function. 



It is not difficult to show that the sine function is continuous at zero, but that is not important 
right now. What is important is that 

/oo poo 
smc{x)dx = 7t, while / \smc{x)\ dx = °°. 
-oo J — oo 

The integral of the sine function is a continuous analogue of the alternating harmonic series £ {-'^T/n, 
while the absolute value is like the regular harmonic series V"- particular, the fact that the 
integral converges must be done directly rather than using comparison test. 

We will not prove the first statement exactly. Let us simply prove that the integral of the sine 
function converges, but we will not worry about the exact hmit. Because = ^^y^' it is enough 

to show that 

sin(jc) , 

dx 

In X 

converges. We also avoid x = this way to make our life simpler. 
For any n G N, we have that for x E [n2n, n{2n + 1)] 

sin(jc) ^ sin(x) ^ sin(x) 



7r{2n + \) X nln 



* Shortened from Latin: sinus cardinalis 
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as sin(x) > 0. Onx e [n{2n+ l),7t{2n + 2)] 



sin(x) ^ sin(x) ^ sin(x) 



;r(2n+l) ~ x ~ ;r(2n + 2)' 



as sm{x) < 0. 

Via the fundamental theorem of calculus, 



^(2«+i) sin(x) 



;r(2n + l) Jnin :?r(2n+l) 
Similarly 



dx < 



Kin 



X 



/•^(2n+i) sin(A;) , 1 
dx< / dx= —. 

Jnln Ttln Ttn 



< 



n{2n+2) sin(x) 

7r(2n+ 1) ~ A(2n+i) X 
Putting the two together we have 



dx < 



Tt{n+ 1)' 







2 2 r 

;r(2n+l) ~ ;r(2n + l)^ - A 



1 



1 



dx< -. = — 



:{2n+l) 7t{2n 

Let M > 2;r be arbitrary, and let /: e N be the largest integer such that 2k7t < M. Then 



r^smix) , sin(;c) , 

/ — —dx= — —dx + 

Jin X JlK X JlkK 



sin(;c) 



dx. 



2kK X 



Forjc e [2kn,M] we have that 2^ < ^^^^ < and so 



M 



sm{x) 



dx 



2k7C X 



^ M-2kTi ^ 1 
~ 2kn ~ k' 



As k is the largest k such that 2k7t < M, this term goes to zero as M goes to infinity. 
Next 

'•2*'^ sin(x) 



0< 



1 

2k X ~ ^i^n{n+l)' 



2l 



and this series converges as ^ — > 00. 

Putting the two statements together we obtain that 

' sin(jc) 



X 



„=i^«(« + l) 



< 00. 



The double sided integral of sine also exists as noted above. We leave the other statement — that 
the integral of the absolute value of the sine function diverges — as an exercise. 
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5.5.1 Integral test for series 

It can be very useful to use the fundamental theorem of calculus in proving a series is summable 
and to estimate its sum. 

Proposition 5.5.13. Suppose that f: [k, oo) — )■ M is a decreasing nonnegative function where k eZ. 
Then 



n=k 

In this case 



V f(n) converges if and only if if converges. 

Jk 



poo °° POO 

/ /<£/W</(fc)+/ /. 



By Exercise 5.2.14, / is integrable on every interval [k,b] for all b> k, so the statement of the 
theorem makes sense without additional hypotheses of integrability. 

Proof. Let £ > be given. And suppose that /^/converges. Let £, m G Z be such that m> i>k. 
Because / is decreasing we have /"^^ / < f{n) < /• Therefore 



/ /=I/ /<i:/w</w+I / f<m+ /• (5.5) 

J^ n=lJ" n=£ „=^+l-^«-l -'i 



As before, since / is positive then there exists an L e N such that if £ > L, then JP f < e/2 for all 
m> i. We note that / must decrease to zero (why?). So let us also suppose that for £ > L we have 
/(£) < £/2. For such i and m we have via (5.5) 



£/W </(£)+ / f<e/2 + e/2<e. 



The series is therefore Cauchy and thus converges. The estimate in the proposition is obtained by 
letting m go to infinity in (5.5) with £ — k. 

Conversely suppose that /^°°/ diverges. As / is positive then by Proposition 5.5.4, the sequence 
f}Z=k diverges to infinity. Using (5.5) with £ = kwe find 



rm in— I 

/ /< I/W- 

Jk 



m— 1 

E- 

n=k 

As the left hand side goes to infinity as m — )■ oo, so does the right hand side. □ 



Example 5.5.14: Let us show that Y.n=i ^ exists and let us estimate its sum to within 0.01. As 
this series is the p-series for = 2, we already know it converges, but we have only very roughly 
estimated its sum. 
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Using fundamental theorem of calculus we find that for G N we have 

-1^ dx= -. 
k x-^ k 

In particular, the series must converge. But we also have that 

1 r \ , ^1 1 Z""' 1 , 1 1 

Adding the partial sum up to — 1 we get 

1 1 1 1 1 'y ]_ 

n—l n=l n=l 

In Other words, iA + L^=i V"^ estimate for the sum to within i/yt^. Therefore, if we wish to 
find the sum to within 0.01, we note that i/io^ = 0.01. We obtain 

1 ^1 °° 1 1 1 ^1 
1.6397...^-^+y:4< E4<T^ + ^+E4^1-6497.... 
10 ~ '^,rp-~ 100 10 

n=l 11=1 11= I 

In fact, the actual sum is ^^/e ^ 1 .6449 .... 
5.5.2 Exercises 

Exercise 5.5.1: Finish the proof of Proposition 5.5.2. 

oo 

Exercise 5.5.2; Find out for which a G M does ^ e"'^ converge. When the series converges, find an upper 

n=l 

bound for the sum. 

oo 

Exercise 5.5.3: a) Estimate ^ n{n+\) ^^^^^(^f within 0.01 using the integral test, b) Compute the limit of 

n=\ 

the series exactly and compare. Hint: the sum telescopes. 
Exercise 5.5.4; Prove that 

poo 

IsincU)! dx = oo. 



Hint: again, it is enough to show this on just one side. 
Exercise 5.5.5; Can you interpret 



— ^ dx 
-1 \/\x\ 



as an improper integral? If so, compute its value. 
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Exercise 5.5.6; Take f : [0, oo'j — > Riemann integrable on every interval [0, b], and such that there exist M, 
a, and T, such that |/(/)| < Me"' for all t >T. Show that the Laplace transform off exists. That is, for every 
s > a the following integral converges: 



poo 

\s):= / fit)e-^'dt. 
Jo 



Exercise 5.5.7: Let f: W—^W be a Riemann integrable function on every interval [a,b], and such that 
XZo \f{x)\ dx < oo. Show that the Fourier sine and cosine transforms exist. That is, for every (0>0 the 
following integrals converge 

F\(o) := - / /(?)sin(G)?) dt, E'^{(o) := - f{t) cos{(Ot) dt. 

Furthermore, show that F^ and F^ are bounded functions. 

Exercise 5.5.8; Suppose f: [0,oo) — > M is Riemann integrable on every interval \fd,b]. Show that Jq f 
converges if and only if for every £ > there exists an M such that ifM <a<b then 



taf 



< e. 



Exercise 5.5.9; Suppose that f: [0,oo) — R is nonnegative and decreasing, a) Show that if Jq f < °°, then 
lim fix) = 0. b) Show that the converse does not hold. 

Exercise 5.5.10; Find an example of an unbounded continuous function /: [0,oo) — t- R that is nonnegative 
and such that Jq f <°°. Note that this means that lim;c_j.„/(ji;) does not exist; compare previous exercise. 
Hint: on each interval [k,k+ 1], ^ G N, define a function whose integral over this interval is less than say 2~^. 

Exercise 5.5.11 (More challenging); Find an example of a function f : [0,oo) — ). R integrable on all intervals 
such that lim„_).oo /q / converges as a limit of a sequence, but such that Jq f does not exist. Hint: for all 
n G N, divide [n,n+ 1] into two halves. In one half make the function negative, on the other make the function 
positive. 

Exercise 5.5.12: Show that if f: [1,°°) — > K is such that g{x) := x^f{x) is a bounded function, then 

converges. 

It is sometimes desirable to assign a value to integrals that normally cannot be interpreted as even 
improper integrals, e.g. f\ i/x dx. Suppose /: [a,b] — ^ R is a function and a<c <b where / is Riemann 
integrable on all intervals [a, c — e] and [c + £, Z?] for all £ > 0. Define the Cauchy principal value of f as 



if the limit exists. 

Exercise 5.5.13; a) Compute p.v.j\ i/x dx. 

b) Compute \im^^Q+ [jZi ^/x dx + ^■^) show it is not equal to the principal value. 

c) Show that iff is integrable on [a,b], then p.v.J^ f = f. 

d) Find an example of an f with a singularity at c as above such that P-v.J^ f exists, but the improper 
integrals f and J^. f diverge. 

e) Suppose that f: [—1,1] ffi is continuous. Show that p.v.j\ dx exists. 
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Exercise 5.5.14; Let / : M — >■ M and g: M — >■ M fee continuous functions, where g{x) = Ofor all x ^ [a, b] for 
some interval [a,b]. 

a) Show that the convolution 

/oo 
fit)gix-t)dt 
-oo 

is well-defined for all x G M. 

b) Suppose that J^^ \ fix)\ dx < o°. Prove that 

lim (g * /) (x) = 0, and lim (g * /) (x) = 0. 
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Chapter 6 

Sequences of Functions 



6.1 Pointwise and uniform convergence 

Note: 1-1.5 lecture 

Up till now when we talked about sequences we always talked about sequences of numbers. 
However, a very useful concept in analysis is to use a sequence of functions. For example, a solution 
to some differential equation might be found by finding only approximate solutions. Then the real 
solution is some sort of limit of those approximate solutions. 

When talking about sequences of functions, the tricky part is that there are multiple notions of a 
limit. Let us describe two common notions of a limit of a sequence of functions. 

6.1.1 Pointwise convergence 

Definition 6.1.1. For every n G N let /„ : 5 — )► M be a function. We say the sequence 
converges pointwise to / : S — )■ M, if for every x G 5 we have 

f{x) = limfn{x). 

It is common to say that /„ : 5 — )■ M converges to f onT cM. for some / : T — )■ M. In that case 
we, of course, mean that f{x) = lim fn{x) for every x eT. We simply mean that the restrictions of 
/„ to T converge pointwise to /. 

Example 6.1.2: The sequence of functions defined by fn{x) := x^" converges to /: [— 1, 1] — )■ M 
on [—1,1], where 




1 if X = — 1 or .)c = 1, 
otherwise. 



See Figure 6.1. 
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— ^^-^ — =z — I 

Figure 6.1: Graphs of /i, fi, h, and f% for fn{x) := :c^". 

To see that this is so, first take x e (—1,1). Then < < 1. We have seen before that 

|x2"-0| = (x2)"^0 as n^oo. 

Therefore lim = 0. 

When x=lorjc=— 1, then x^" = 1 for all n and hence lim /„ (jc) = 1 . We also note that {/„ (jc) } 
does not converge for all other x. 

Often, functions are given as a series. In this case, we use the notion of pointwise convergence 
to find the values of the function. 

Example 6.1.3: We write 

oo 

to denote the limit of the functions 

When studying series, we have seen that on x e (— 1, 1) the /„ converge pointwise to 

1 

The subtle point here is that while is defined for all x 7^ 1, and /„ are defined for all x (even 
atx = 1), convergence only happens on (—1, 1). 
Therefore, when we write 

00 

we mean that / is defined on ( — 1 , 1 ) and is the pointwise limit of the partial sums. 
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Example 6.1.4: Let fn{x) := sm{xn). Then /„ does not converge pointwise to any function on any 
interval. It may converge at certain points, such as when x = or x = n. It is left as an exercise that 
in any interval [a, b], there exists an x such that sm{xn) does not have a limit as n goes to infinity. 

Before we move to uniform convergence, let us reformulate pointwise convergence in a different 
way. We leave the proof to the reader, it is a simple application of the definition of convergence of a 
sequence of real numbers. 

Proposition 6.1.5. Let f„: 5 — )■ M and / : 5 — )■ M be functions. Then {/„} converges pointwise to f 
if and only if for every x E S, and every e > 0, there exists anN eN such that 

\fn{x)-f{x)\<e 

for all n>N. 

The key point here is that can depend on x, not just on e. That is, for each x we can pick a 
different A^. If we can pick one for all x, we have what is called uniform convergence. 



6.1.2 Uniform convergence 

Definition 6.1.6. Let f-,: 5 — t- M be functions. We say the sequence {/„} converges uniformly to 
/ : 5 — 7- M, if for every £ > there exists anN eN such that for all n > A'^ we have 

\f„{x) - f{x)\ < £ for all 5. 

Note that A^ now cannot depend on x. Given e > we must find an A^^ that works for all x E S. 
Because of Proposition 6.1.5, we see that uniform convergence implies pointwise convergence. 

Proposition 6.1.7. Let {/„} be a sequence of functions fn'- 5 — )• M. If {ft} converges uniformly to 
f: 5 — > M, then {/„} converges pointwise to f. 

The converse does not hold. 

Example 6.1.8: The functions fn{x) :=x^" do not converge uniformly on [—1,1], even though they 
converge pointwise. To see this, suppose for contradiction that they did. Take £ := 1/2, then there 
would have to exist an A^ such that x^^ < 1/2 for all x E [0, 1) (as f„{x) converges to on ( — 1, 1)). 
But that means that for any sequence {xk} in [0, 1) such that \imxk = 1 we have xf^ < 1/2. On the 
other hand x^^ is a continuous function of x (it is a polynomial), therefore we obtain a contradiction 

l = l2A'=lim;cf <l/2. 

However, if we restrict our domain to [—a, a] where < a < 1, then /„ converges uniformly to 
on [—a, a]. Again to see this note that a^" — i- as n — )■ 00. Thus given e > 0, pick N eN such that 
a^" < £ for all n > A^. Then for any x E [—a, a] we have \x\ < a. Therefore, for n>N 

U2^| = M2^<a2^<£. 
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6.1.3 Convergence in uniform norm 

For bounded functions there is another more abstract way to think of uniform convergence. To every 
bounded function we assign a certain nonnegative number (called the uniform norm). This number 
measures the "distance" of the function from 0. Then we can "measure" how far two functions 
are from each other. We can then simply translate a statement about uniform convergence into a 
statement about a certain sequence of real numbers converging to zero. 

Definition 6.1.9. Let /: 5 ^ R be a bounded function. Define 



is called the uniform norm. 

Proposition 6.1.10. A sequence of bounded functions fn'. 5 ^ M converges uniformly to f: 5 ^ M, 
if and only if 



Proof First suppose that lim — / = 0. Let e > be given. Then there exists an A'^ such that 
for n>N we have ||/m — < £■ As ||/n — is the supremum of |/n(x) — we see that for 
all X we have \fn{x) — f{x) \ < e. 

On the other hand, suppose that /„ converges uniformly to /. Let e > be given. Then find 
such that \fn{x) —f{x)\ < £ for all xe S. Taking the supremum we see that ||/n — / ||« < £• Hence 



Sometimes it is said that /„ converges to f in uniform norm instead of converges uniformly. 
The proposition says that the two notions are the same thing. 

Example 6.1.11: Let /„: [0, 1] ^ M be defined by f„{x) := Then we claim that /„ 

converge uniformly to f{x) := x. Let us compute: 



||/||,:=sup{|/W|:xe5}. 



lim||/„-/||„ = 0. 

n— >oo 



lim||/„-/|L = 0. 



□ 



ll/n-/IL = SUp 



+ sin(n^ ) 



-jc : jc e [0, 1] 



n 




Using uniform norm, we define Cauchy sequences in a similar way as we define Cauchy 
sequences of real numbers. 
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Definition 6.1.12. Let /„ : 5 — )> M be bounded functions. We say that the sequence is Cauchy in the 
uniform norm or uniformly Cauchy if for every e > 0, there exists an A/^ e N such that for m,k>N 
we have 

||/m-AL<e- 

Proposition 6.1.13. Let /„: S be bounded functions. Then {/„} is Cauchy in the uniform 
norm if and only if there exists an f: 5 ^ M and {/„} converges uniformly to f. 

Proof. Let us first suppose that {/„} is Cauchy in the uniform norm. Let us define /. Fix x, then 
the sequence is Cauchy because 

|/mW-/fcWI<||/«-ML. 

Thus {fn{x)} converges to some real number so define 

f{x) := lim/„(x). 

Therefore, /„ converges pointwise to /. To show that convergence is uniform, let e > be given. 
Pick a positive e' < e and find an such that for m^ky N wc have — /^H < e'. In other words 
for all X we have \ fm{x) — fk{x) \ < e' . We take the limit as k goes to infinity. Then \ fm{x) — fk{x) \ 
goes to \fm{x) — f{x)\- Therefore for all x we get 

\fm{x)-f{x)\<e'<e. 

And hence /„ converges uniformly. 

For the other direction, suppose that {/„} converges uniformly to /. Given e > 0, find A'^ such 
that for all n > we have \fn{x) — /(•^) | < for all x e 5. Therefore for all m, A: > A^ we have 

\fm{x) - fk{x)\ = \f,n{x)-f{x) + f{x) - fk{x)\ < \f^{x) - f{x)\ + \f{x)-fk{x)\ < e/4 + e/4. 

Take supremum over all x to obtain 

\\fm-fk\\u<'/^<£- 

□ 

6.1.4 Exercises 

Exercise 6.1.1; Let f and g be bounded functions on [a, b\. Show that 

\\f + 8\\u<\\f\\u+\\s\\u- 
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Exercise 6.1.2: a) Find the pointwise limit for x G M. 

n 

b) Is the limit uniform on M? 

c) Is the limit uniform on [0, 1] ? 

Exercise 6.1.3; Suppose fi' 5 — )• M are functions that converge uniformly to f: 5 — )• M. Suppose that A C S. 
Show that the sequence of restrictions {/,j|a} converges uniformly to /|a- 

Exercise 6.1.4; Suppose that {/„} and {gn} defined on some set A converge to f and g respectively pointwise. 
Show that {/„ +gn} converges pointwise to f + g. 

Exercise 6.1.5; Suppose that {/„} and {gn} defined on some set A converge to f and g respectively uniformly 
on A. Show that {f„ + gn } converges uniformly to f + g on A. 

Exercise 6.1.6; Find an example of a sequence of functions {/„} and {gn} that converge uniformly to some 
f and g on some set A, but such that {fngn} ( the multiple ) does not converge uniformly to fg on A. Hint: Let 
A := M, let f{x) := g{x) := x. You can even pick fn = gn- 

Exercise 6.1.7; Suppose that there exists a sequence of functions {gn} uniformly converging to on A. Now 
suppose that we have a sequence of functions {fn} and a function fonA such that 

\fn{x)-f{x)\<gn{x) 

for all X GA. Show that {fn} converges uniformly to f on A. 

Exercise 6.1.8; Let {fn}, {gn} cmd {hn} be sequences of functions on \a,b\. Suppose that {/„} and {/j,,} 
converge uniformly to some function f: [a , Z?] — )• M and suppose that fn [x) < gn {^^^ ^ hn{x) for all x G [a,b]. 
Show that {gn} converges uniformly to f. 

Exercise 6.1.9; Let fn : [0, 1] — )• R Z?e a sequence of increasing functions (that is fn{x) > fn{y) whenever 
x>y)- Suppose that /(O) = and that lim =0. Show that {/„} converges uniformly to 0. 

Exercise 6.1.10; Let {fn} be a sequence of functions defined on [0, 1]. Suppose that there exists a sequence 
of numbers Xn G [0, 1] such that 

fn ip^n ) — 1 • 

Prove or disprove the following statements: 

a) True or false: There exists {fn} as above that converges to pointwise. 

b) True or false: There exists {fn} as above that converges to uniformly on [0, 1]. 

Exercise 6.1.11; Fix a continuous h: [a,b] — )• M. Let f{x) = h[x) for x G [a,b\, f{x) = h{a) for x < a and 
f{x) = h{b) for all x > b. First show that /: R — )• M w continuous. Now let fn be the function g from 
Exercise 5.3.7 with s = i/n, defined on the interval [a,b\. Show that {fn} converges uniformly to h on [a^b\. 
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6.2 Interchange of limits 

Note: 1-1.5 lectures 

Large parts of modern analysis deal mainly with the question of the interchange of two limiting 
operations. It is easy to see that when we have a chain of two limits, we cannot always just swap the 
limits. For example, 

= lim ( lim ) ^ lim ( lim ) = 1. 

When talking about sequences of functions, interchange of limits comes up quite often. We treat 
two cases. First we look at continuity of the limit, and second we look at the integral of the limit. 



6.2.1 Continuity of the limit 

If we have a sequence of continuous functions, is the limit continuous? Suppose that / is the 
(pointwise) limit of /„. If lim = x we are interested in the following interchange of limits. The 
equality we have to prove (it is not always true) is marked with a question mark. In fact the limits to 
the left of the question mark might not even exist. 

lim f{xk) = lim ( lim /„(x^) ) = lim (lim fnixk)) = lim f„{x) = f{x). 

In particular, we wish to find conditions on the sequence {/„} so that the above equation holds. It 
turns out that if we only require pointwise convergence, then the limit of a sequence of functions 
need not be continuous, and the above equation need not hold. 

Example 6.2.1: Let : [0, 1] M be defined as 



fn{x) :-- 



l—nx iix < 
ifx>i/«. 



See Figure 6.2 . 

Each function /„ is continuous. Fix an x G (0, 1]. If n > i/x, then x > i/n. Therefore for n > i/x 
we have fn{x) — 0, and so 

lim/„(x) =0. 

On the other hand if x = 0, then 

lim/„(0) = lim 1 = 1. 
Thus the pointwise hmit of /„ is the function /: [0, 1] — )► M defined by 



The function / is not continuous at 0. 



1 ifx = 0, 
ifx>0. 



196 CHAPTER 6. SEQUENCES OF FUNCTIONS 




Figure 6.2: Graph of fn{x). 



If we, however, require the convergence to be uniform, the limits can be interchanged. 

Theorem 6.2.2. Let {/„} be a sequence of continuous functions fn'. S ^ M converging uniformly 
to f: 5 ^ M. Then f is continuous. 

Proof. Let x G S be fixed. Let {x„} be a sequence in S converging to x. 

Let e > be given. As f^ converges uniformly to /, we find a e N such that 

\fk{y)-f{y)\<'^h 

for all y e 5. As fj^ is continuous at x, we find an A/^ e N such that for m>N we have 

\fk{Xm) - fk{x)\ < 

Thus for m>N we have 

\f{Xm)-f{x) \ = \fiXm) - fkiXm) + fk{Xm) - fkix) + AW - 

< \f{Xm)-fkiXm)\ + \fk{Xm)-fk{x)\ + \fk{x) - f{x)\ 
<e/3 + e/3 + e/3 = e. 

Therefore {f{xm)} converges to f{x) and hence / is continuous at x. As x was arbitrary, / is 
continuous everywhere. □ 

6.2.2 Integral of the limit 



Again, if we simply require pointwise convergence, then the integral of a limit of a sequence of 
functions need not be equal to the limit of the integrals. 
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Example 6.2.3: Let : [0, 1] M be defined as 



See Figure 6.3. 



ifx = 0, 

f„{x) := n — n^x if < X < 

ifx>l/n. 




Figure 6.3: Graph of fn{x) 



Each /„ is Riemann integrable (it is continuous on (0, 1] and bounded), and it is easy to see 

/ fn= {n — nx)dx=^/2. 
Jo Jo 

Let us compute the pointwise limit of {/,,}. Fix an jc G (0, 1]. For n > i/x we have x > i/n and so 
fn{x) = 0. Therefore 

\imfn{x) =0. 

We also have /„(0) = for all n. Therefore the pointwise limit of {/„} is the zero function. Thus 
i/2=lim / fn{x)dx^ I (lim/„(jc)) dx= / Odx = 0. 

n^oojQ Jq \n^oo ) Jq 

But if we again require the convergence to be uniform, the limits can be interchanged. 

Theorem 6.2.4. Let {/„} be a sequence of Riemann integrable functions f„: [a,b] -^M. converging 
uniformly to f: [a,b] ^M.. Then f is Riemann integrable and 



rb rb 

/ /=lim / /„. 

J a n^'^Ja 
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Proof. Let e > be given. As /„ goes to / uniformly, we find an M G N such that for all n > M 
we have \ fn{x) — f{x)\ < 2[b-a) ^ ^ [a,b]. In particular, by reverse triangle inequality 

\f{x) I < 2{b-a) I ^'-'^ ^' hence / is bounded as /„ is bounded. Note that /„ is integrable 

and compute 



/- / / 



(/(■«) - fn {x) + /„ (x) ) dx- {f{x) - fn {x) + /„ {x) ) dx 

J a 

rb rb rb rb 

</ {f{x)-fn{x))dx+ fn{x)dx- [f [x) - fn{x)) dx - j fn{x)dx 
J a Ja J a J a 

rb rb rb rb 

= / {f{x)-fn{x))dx+ f„{x)dx- {f{x)~ fn{x)) dx- I fn{x)dx 
Ja Ja Ja Ja 

rb rb 

' {f{x)-fn{x))dx- {f{x)-fn{x))dx 
a J a 



£ £ 

< — r{b-a) + — rib -a) = £. 



2{b-a) 



2{b-a) 



The first inequality is Exercise 5.2.16 (it follows as supremum of a sum is less than or equal to 
the sum of suprema and similarly for infima, see Exercise 1.3.7). The second inequality follows 
from Proposition 5.1.8 and the fact that for all x G [a,b\ we have 2{b-a) ^ ■^('^) ~/"(-^) < 2{b~a) • 
As £ > U was arbitrary, / is Riemann integrable. 

Finally we compute We apply Proposition 5.1.10 in the calculation. Again, for n> M 
(where M is the same as above) we have 



{fix)- f nix)) dx 



rb rb 






/ /- / fn 




J 


J a J a 




J a 



Therefore {/^, /„} converges to /. 
Example 6.2.5: Suppose we wish to compute 

'1 nx + s'minx^) 



□ 



lim 

n^°° JQ 



dx. 



It is impossible to compute the integrals for any particular n using calculus as sin(nx^) has no closed- 
form antiderivative. However, we can compute the limit. We have shown before that 
converges uniformly on [0, 1] to x. By Theorem 6.2.4, the limit exists and 



nx+sin(nx 



lim 

n^oo jQ 



nx + s'minx^) 



dx= / X dx= 1/2. 
Jo 
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Example 6.2.6: If convergence is only pointwise, the limit need not even be Riemann integrable. 
On [0, 1] define 

1 if X = p/q in lowest terms and q < n, 
otherwise. 



The function /„ differs from the zero function at finitely many points; there are only finitely many 
fractions in [0, 1] with denominator less than or equal to n. So /„ is integrable and /J /„ = /q = 0. 
It is an easy exercise to show that /„ converges pointwise to the Dirichlet function 



which is not Riemann integrable. 



1 ifxeQ, 
otherwise, 



6.2.3 Exercises 

Exercise 6.2.1; While uniform convergence can preserve continuity, it does not preserve differentiability. Find 
an explicit example of a sequence of differentiable functions on [—1,1] that converge uniformly to a function 
f such that f is not differentiable. Hint: Consider |x|'^^^", show that these functions are differentiable, 
converge uniformly, and then show that the limit is not differentiable. 

Exercise 6.2.2: Let fn{x) = — . Show that fn converges uniformly to a differentiable function f on [0, 1] (find 
f). However, show that f'{l) / lim/,^(l). 

Note: The previous two exercises show that we cannot simply swap limits with derivatives, even if the 
convergence is uniform. See also Exercise 6.2.7 below. 

fl fix) 

Exercise 6.2.2): Letf: I] be a Riemann integrable (hence bounded) function. Find \\m I dx. 

"^°°.Jq n 

Exercise 6.2.4: Show lim / e dx = 0. Feel free to use what you know about the exponential function 

«— J \ 

from calculus. 

Exercise 6.2.5: Find an example of a sequence of continuous functions on (0, 1) that converges pointwise to 
a continuous function on (0, 1), but the convergence is not uniform. 

Note: In the previous exercise, (0, 1) was picked for simplicity. For a more challenging exercise, replace 
(0,1) with [0,1]. 

Exercise 6.2.6: True/False; prove or find a counterexample to the following statement: If {fx} is a sequence 
of everywhere discontinuous functions on [0, 1] that converge uniformly to a function f, then f is everywhere 
discontinuous. 
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Exercise 6.2.7; For a continuously dijferentiable function f: [a,b] — ?• M, define 

ll/llc :=II/L + II/1L,- 

Suppose that {/„} is a sequence of continuously dijferentiable functions such that for every £ > 0, there exists 
an M such that for all n,k > M we have 

Wfn-fkWc' <£. 

Show that {/„} converges uniformly to some continuously differentiable function f \ [a,b] — t- M. 

For the following two exercises let us define for a Riemann integrable function / : [0, 1] — M the following 
number 

11/11^1 := f\f{x)\dx. 
Jo 

It is true that |/| is integrable whenever / is, see Exercise 5.2.15. This norm defines another very common 
type of convergence called the -convergence, that is however a bit more subtle. 

Exercise 6.2.8: Suppose that {/„} is a sequence of Riemann integrable functions on [0, 1] that converges 
uniformly to 0. Show that 

Exercise 6.2.9: Find a sequence of Riemann integrable functions {/„} on [0, 1] that converges pointwise to 
0, but 

Um ll/n II r 1 does not exist ( is 

Exercise 6.2.10 (Hard): Prove Dini's theorem.- Let fn : [a,b] -^M be a sequence of continuous functions 
such that 

< fn+i [x) < < • • • < /i (x) for all « G N. 
Suppose that {/„} converges pointwise to 0. Show that {fn} converges to zero uniformly. 

Exercise 6.2.11: Suppose that fn '■ [a,b\ — )• M a sequence of continuous functions that converges pointwise 
to a continuous f: [a,b] — >• M. Suppose that for any x E [a,b] the sequence {|/«(x) — f{x)\} is monotone. 
Show that the sequence {/„} converges uniformly. 

Exercise 6.2.12: Find a sequence of Riemann integrable functions fn'. — t- M such that fn converges to 
zero pointwise, and such that a) {/o/«}„_i increases without bound, b) {/o^/n}„_i the sequence 
-1,1,-1,1,-1,1,.... 
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6.3 Picard's theorem 

Note: 1-2 lectures (can be safely skipped) 

A course such as this one should have a piece de resistance caliber theorem. We pick a theorem 
whose proof combines everything we have learned. It is more sophisticated than the fundamental 
theorem of calculus, the first highlight theorem of this course. The theorem we are talking about is 
Picard's theorem* on existence and uniqueness of a solution to an ordinary differential equation. 
Both the statement and the proof are beautiful examples of what one can do with all that we 
have learned. It is also a good example of how analysis is applied as differential equations are 
indispensable in science. 

6.3.1 First order ordinary differential equation 

Modern science is described in the language of differential equations. That is equations that involve 
not only the unknown, but also its derivatives. The simplest nontrivial form of a differential equation 
is the so-called first order ordinary differential equation 

y' = F{x,y). 

Generally we also specify that y{xQ) =yQ. The solution of the equation is a function y{x) such that 
y{xQ)=yQ and y'{x) =F{x,y{x)). 

When F involves only the x variable, the solution is given by the fundamental theorem of 
calculus. On the other hand, when F depends on both x and y we need far more firepower. It is not 
always true that a solution exists, and if it does, that it is the unique solution. Picard's theorem gives 
us certain sufficient conditions for existence and uniqueness. 

6.3.2 The theorem 

We need a definition of continuity in two variables. First, a point in the plane = M x M is denoted 
by an ordered pair (x, y) . To make matters simple, let us give the following sequential definition of 
continuity. 

Definition 6.3.1. Let i7 c be a set and F: U — )■ M be a function. Let {x,y) G t/ be a point. 
The function F is continuous at {x,y) if for every sequence {(■'Cn, Jn)},7=i of points in U such that 
hm Xn=x and lim };„ = y, we have that 

Y\mF{xn,yn) =F{x,y). 

We say F is continuous if it is continuous at all points in U. 

*Named for the French mathematician Charles Emile Picard (1856-1941). 
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Theorem 6.3.2 (Picard's theorem on existence and uniqueness). Let I, J cM.be closed bounded 
intervals, let Iq and Jq be their interiors, and let (xo,jo) ^ x -^o- Suppose F : I x J ^ M. is 
continuous and Lipschitz in the second variable, that is, there exists a number L such that 

\F{x,y) —F{x,z) \ <L\y — z\ for ally, J, xEl. 

Then there exists an h> and a unique dijferentiable function f: [xq — h,XQ + h] ^ J C M., such 
that 

f'{x) =F{xJ{x)) and f{xo)=yo. (6.1) 

Proof. Suppose that we could find a solution /. By the fundamental theorem of calculus we can 
integrate the equation f'{x) = F{x,f{x)), f{xQ) = yo, and write (6.1) as the integral equation 

f{x)=yo+ rF{tJ{t))dt. (6.2) 

The idea of our proof is that we try to plug in approximations to a solution to the right-hand side 
of (6.2) to get better approximations on the left hand side of (6.2) . We hope that in the end the 
sequence converges and solves (6.2) and hence (6.1) . The technique below is called Picard iteration, 
and the individual functions are called the Picard iterates. 

Without loss of generality, suppose that xq = (exercise below). Another exercise tells us that 
F is bounded as it is continuous. Let M := sup{|F(x, j) | : (x,y) G / x J}. Without loss of generality, 
we can assume M > (why?). Pick a > such that [—a, a] C / and [yo — oc,yQ-\- a] C J . Define 

:= min < a, — I . (6.3) 
\ M + La J 

Now note that [-h, h] C /. 

Set fo{x) := yo. We define fk inductively. Assuming that A-i ([— ^, h]) C [yo — a,yo -f a], we 
see that ^ well defined function of f for f G [—h,h\. Further if is continuous on 

[—h,h], then F(?,/yt-i(?)) is continuous as a function of t on [—h,h] (left as an exercise). Define 



AW ■=yo+ F{tjk-i{t)) dt, 



and A is continuous on [—h, h] by the fundamental theorem of calculus. To see that A maps [—h, h] 
to [yo — , Jo + Cf ] . we compute for x G [—h, h] 



IAW-3'ol 



F[tJk-i{t)) dt 



<M\x\<Mh< M — - — < a. 



We now define A+i ^nd so on, and we have defined a sequence [fk] of functions. We need to show 
that it converges to a function / that solves the equation (6.2) and therefore (6.1). 
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We wish to show that the sequence {/^} converges uniformly to some function on [—h,h]. First, 
for t E [—h,h] we have the following useful bound 

\F{tJ„{t))-F{tJkit))\ <L\f„{t)-fkit)\<L\\f„-fkt, 

where \\f„ — fk\\u is the uniform norm, that is the supremum of \fn{t) — ^ ^ [—h^h]. Now 

note that \x\ < h< . , ^ . Therefore 



M+La- 



I^F{t,fn-i{t)) dt-j^F{t,fk-i{t)) dt 

l^F{t,fn-i{t))-F{t,fk^l{t))dt 



LetC 



<L||/„_i-/fc_i||Jx| 
and note that C < 1 . Taking supremum on the left-hand side we get 

<C||/„_l -fk-l\\u- 



M+La 



Without loss of generality, suppose that n>k. Then by induction we can show that 

Wfn-fkWu < C'Wfn^k- foWu- 

For X G [— /z, h] we have 



\fn-k{x) - fo{x) \ = \ fn-k{x) -yo\ < «. 



Therefore 



||/.-MI„<C^||/„-^-/o||„<C^a. 
As C < 1, {/„} is uniformly Cauchy and by Proposition 6.1.13 we obtain that {/„} converges 
uniformly on [—h,h\ to some function /: [—h,h\ — )• M. The function / is the uniform limit of 
continuous functions and therefore continuous. 

We now need to show that / solves (6.2) . First, as before we notice 

\F{tJ„{t))-F{tJit))\ <L\Mt)-fit)\<L\\fn-f\l. 

As IIA — converges to 0, then F (?,/„(?)) converges uniformly to F (t,f{t)) where t G [—h,h]. 
Then the convergence is then uniform (why?) on [0,x] (or [x,0] if x < 0) if G [—h^h]. Therefore, 



3^0+ / F{tJ{t)) dt=yo+ / F{t, lim/„(0) dt 

Jo ri^°° 

= yQ+ ^^T^ F (t , fn{t)) dt (by continuity of F) 



lim {yo+ FitJnit)) dt 
lim/„+i(x) =f{x). 



(by uniform convergence) 
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We apply the fundamental theorem of calculus to show that / is differentiable and its derivative is 

F (x, f{x)) . It is obvious that /(O) = yo. 

Finally, what is left to do is to show uniqueness. Suppose g: [—h,h\ — )■ M is another solution. 
As before we use the fact that \F{tJ{t)) -F{t,g{t)) \ <L\\f-g\\^^. Then 

X 



yo + J^F{t,f{t)) dt- (^yo + jy{t,g{t)) dt 
F{t,f{t))-F{t,g{t))dt 



La 

<L\\f-8\\u\^\<Lh\\f-g\l<^^^^\\f-g\l. 
As before, C = j^i^ < 1. By taking supremum over G [—h,h] on the left hand side we obtain 

||/-glL<c||/-g||„. 

This is only possible if || / — = 0. Therefore, f — g, and the solution is unique. □ 

6.3.3 Examples 

Let us look at some examples. The proof of the theorem gives us an explicit way to find an h that 
works. It does not, however, give use the best h. It is often possible to find a much larger h for 
which the conclusion of the theorem holds. 

The proof also gives us the Picard iterates as approximations to the solution. So the proof 
actually tells us how to obtain the solution, not just that the solution exists. 

Example 6.3.3: Consider 

fix) = fix), /(0) = 1. 

That is, we let F(;c,j) = y, and we are looking for a function / such that fix) — fix). We pick any 
/ that contains in the interior. We pick an arbitrary / that contains 1 in its interior. We can use 
L = 1. The theorem guarantees anh>0 such that there exists a unique solution / : [—h, h] — > R. 
This solution is usually denoted by 

e^:=fix). 

We leave it to the reader to verify that by picking / and J large enough the proof of the theorem 
guarantees that we are able to pick a such that we get any h we want as long as h< 1/2. We omit 
the calculation. 

Of course, we know that this function exists as a function for all x, so an arbitrary h ought 
to work. By same reasoning as above, we can see that no matter what xq and yo are, the proof 
guarantees an arbitrary h as long as < 1/2. Fix such an h. We get a unique function defined on 
[xo — h,xo + h]. After defining the function on [—h,h] we find a solution on the interval [0, 2h] and 
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notice that the two functions must coincide on [0,/?] by uniqueness. We can thus iteratively construct 
the exponential for all x G M. Therefore Picard's theorem could be used to prove the existence and 
uniqueness of the exponential. 

Let us compute the Picard iterates. We start with the constant function fQ{x) := 1. Then 

/iw=i + r h{s) ds=\+x, 

Jo 

/2W = 1+ r/lis) ds=l+ Hl+S) ds=l+X + '^, 
JO JO 2 

h{x) = \ + f2{s)ds = \ + ( ) ds=l+x+- + -. 

We recognize the beginning of the Taylor series for the exponential. 
Example 6.3.4: Suppose we have the equation 

f'{x) = {f{x)f and /(O) = 1. 
From elementary differential equations we know that 



\-x 

is the solution. Do note that the solution is only defined on (—0°, 1). That is, we are able to use 
h<l, but never a larger h. Note that the function that takes y to is not Lipschitz as a function on 
all of M. As we approach x=l from the left, the solution becomes larger and larger. The derivative 
of the solution grows as j^, and therefore the L required will have to be larger and larger as 
grows. Thus if we apply the theorem with xq close to 1 and jq = we find that the h that the 
proof guarantees will be smaller and smaller as xq approaches 1 . 

By picking a correctly, the proof of the theorem guarantees h = \ — V^/i ^ 0. 134 (we omit the 
calculation) for xq = and yQ = l, even though we see from above that any h < \ should work. 

Example 6.3.5: Suppose we start with the equation 

f'{x)=2^\f{x)l /(0)=0. 

Note that F{x,y) = 2^J\y\ is not Lipschitz in y (why?). Therefore the equation does not satisfy the 
hypotheses of the theorem. The function 

' x^ ifjc>0, 
-x^ if jc < 0, 



is a solution, but g{x) = is also a solution. So a solution is not unique. 

Example 6.3.6: Consider y' = (p{x) where (p{x) := if jc G Q and (p{x) := \ if x ^ Q. The equation 
has no solution regardless of the initial conditions. A solution would have derivative (p, but cp does 
not have the intermediate value property at any point (why?). So no solution can exist by Darboux's 
theorem. Therefore to obtain existence of a solution, some continuity hypothesis on F is necessary. 
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6.3.4 Exercises 

Exercise 6.3.1; Let I, J cM.be intervals. Let F : I x J ^M.he a continuous function of two variables and 
suppose that f\ I ^ J be a continuous function. Show that F (x, f{xj) is a continuous function on L 

Exercise 6.3.2: Let I,J G^be closed bounded intervals. Show that ifF : / x 7 — > M « continuous, then F is 
bounded. 

Exercise 6.3.3; We have proved Picard's theorem under the assumption that xo = 0. Prove the full statement 
ofPicard's theorem for an arbitrary xq. 

Exercise 6.3.4; Let f'{x) = xf(x) be our equation. Start with the initial condition f(0) = 2 and find the 
Picard iterates /o , /i , /2 , /a , A- 

Exercise 6.3.5; Suppose that F: I xJ —^Wis a function that is continuous in the first variable, that is, for 
any fixed y the function that takes x to F{x,y) is continuous. Further, suppose that F is Lipschitz in the second 
variable, that is, there exists a number L such that 

\F {x,y) — F {x,z) \ <L\y — z\ for ally, z EJ,xeL 

Show that F is continuous as a function of two variables. Therefore, the hypotheses in the theorem could be 
made even weaker 

Exercise 6.3.6; A common type of equation one encounters are linear first order differential equations, that 
is equations of the form 

y' + p{x)y = q{x) , y{xo) = yo- 

Prove Picard's theorem for linear equations. Suppose that I is an interval, xq G /, and p: I —^M. and ^ : / — > M 
are continuous. Show that there exists a unique dijferentiable f: / — > M, such that y = f{x) satisfies the 
equation and the initial condition. Hint: Assume existence of the exponential function and use the integrating 
factor formula for existence of f (prove that it works): 

fix) := g-Z^^W'^^ (^£/oPW%(?) dt+yo^ . 
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7.1 Metric spaces 

Note: 1.5 lectures 

As mentioned in the introduction, the main idea in analysis is to take limits. In chapter 2 we 
learned to take limits of sequences of real numbers. And in chapter 3 we learned to take limits of 
functions as a real number approached some other real number. 

We want to take limits in more complicated contexts. For example, we want to have sequences 
of points in 3-dimensional space. We wish to define continuous functions of several variables. We 
even want to define functions on spaces that are a little harder to describe, such as the surface of the 
earth. We still want to talk about limits there. 

Finally, we have seen the limit of a sequence of functions in chapter 6. We wish to unify all these 
notions so that we do not have to reprove theorems over and over again in each context. The concept 
of a metric space is an elementary yet powerful tool in analysis. And while it is not sufficient to 
describe every type of limit we find in modern analysis, it gets us very far indeed. 

Definition 7.1.1. Let X be a set, and let J : X x X — )► M be a function such that 

(i) d{x,y) > for all x,y in X, 

(ii) d{x^y) = if and only ifx = y, 

(iii) d{x,y) = d{y,x), 

(iv) d{x,z) < d{x,y) + d{y,z) {triangle inequality). 

Then the pair {X,d) is called a metric space. The function d is called the metric or sometimes the 
distance function. Sometimes we just say Z is a metric space if the metric is clear from context. 

The geometric idea is that d is the distance between two points. Items (i)-(iii) have obvious 
geometric interpretation: distance is always nonnegative, the only point that is distance away from 
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z 




Figure 7.1: Diagram of the triangle inequality in metric spaces. 



x\%x itself, and finally that the distance from .jc to j is the same as the distance from y to x. The 
triangle inequality (iv) has the interpretation given in Figure 7.1. 

For the purposes of drawing, it is convenient to draw figures and diagrams in the plane and have 
the metric be the standard distance. However, that is only one particular metric space. Just because 
a certain fact seems to be clear from drawing a picture does not mean it is true. You might be getting 
sidetracked by intuition from euclidean geometry, whereas the concept of a metric space is a lot 
more general. 

Let us give some examples of metric spaces. 

Example 7.1.2: The set of real numbers M is a metric space with the metric 

d{x,y) := \x — y\ . 

Items (i)-(iii) of the definition are easy to verify. The triangle inequality (iv) follows immediately 
from the standard triangle inequality for real numbers: 

d{x,z) = \x-z\ = \x-y + y-z\ < \x-y\ + \y-z\ = d{x,y)+d{y,z). 

This metric is the standard metric on M. If we talk about M as a metric space without mentioning a 
specific metric, we mean this particular metric. 

Example 7.1.3: We can also put a different metric on the set of real numbers. For example, take 
the set of real numbers M together with the metric 





X — 


y\ 




\x 


-y\ 


+ 1 



Items (i)-(iii) are again easy to verify. The triangle inequality (iv) is a little bit more difficult. 
Note that d{x,y) = (p{\x — y\) where (p{t) = ^ and note that (p is an increasing function (positive 
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derivative). Hence 



d{x,z) = (pi\x-z\) = (pi\x-y + y-z\) < (pi\x-y\ + \y-z\) 

\y-z\ 





X — 


y\ + \y- 






\x — y\ 




X 


-y\ + \y-z\ 


+ 1 1 


x-y\ + \y- 


-z\ 


+ 1 



+ 





X — 


y\ 




\x 


-y\ 


+ 1 



j=d{x,y)+d{y,z). 

Here we have an example of a nonstandard metric on R. With this metric we see for example that 
d{x,y) < 1 for all x,y e R. That is, any two points are less than 1 unit apart. 

An important metric space is the n-dimensional euclidean space R" = R x R x • • • x R. We use 
the following notation for points: x= (j:i,X2, . . . e R". We also simply write e R" to mean 
the vector (0, 0, . . . , 0). Before making R" a metric space, let us prove an important inequality, the 
so-called Cauchy-Schwarz inequality. 

Lemma 7.1.4 (Cauchy-Schwarz inequaUty). Takex= (xi,X2,...,Xn) andy= {yi,y2.,....,yn) £ 
R«. Then 

2 / n \ / n 

2 



7=1 / V=i / \y=i 
Proof. Any square of a real number is nonnegative. Hence any sum of squares is nonnegative: 



2 



0< £ £ (xjyk-xkyj) 

j=U=l 
n n 

= E E i^jyl +4yj - ^Fkyjyk) 



7=1 / \k=\ / \;=1 / \k=] / \i=l / \k=i 



\j=l / \k=l / \;=1 / \k=l / V=l / U=l 

We relabel and divide by 2 to obtain 

which is precisely what we wanted. □ 
Example 7.1.5: Let us construct standard metric for R". Define 

dix.y) := ^ (xi -yif+ix2-y2f + ■■■ + (xn-ynf = ^Y^{xj-yjf. 
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For n = 1, the real line, this metric agrees with what we did above. Again, the only tricky part of the 
definition to check is the triangle inequahty. It is less messy to work with the square of the metric. 
In the following, note the use of the Cauchy-Schwarz inequality. 

n 

d{,x,zf= £(^j-Zj)^ 

n 

7=1 

n 

= E ( (-^7 - yjf + (yj - + 2(xj - jj) [jj - zj, 

n n 11 

= E i^j - yjf + E iyj - ^jf + E - yj) (yj - 

;=i 7=1 7=1 



< E (^7-3^7-)'+ E {yj-Zj)' + 2j E (^7-3^7)' E (3^7-^7-)' 
7=1 J=l V i=l 

2 



E(^7-3'7) +yE (3^7-27) 



(J(.)c,y)+J(y,z)) . 



Taking the square root of both sides we obtain the correct inequality. 

Example 7.1.6: An example to keep in mind is the so-called discrete metric. Let X be any set and 
define 

1 if X ^ J, 



d{x,y) :-- 



\ix = y. 



That is, all points are equally distant from each other. When X is a finite set, we can draw a diagram, 
see for example Figure 7.2. Things become subtle when X is an infinite set such as the real numbers. 




Figure 7.2: Sample discrete metric space {a, b, c, d, e}, the distance between any two points is 1. 
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While this particular example seldom comes up in practice, it gives a useful "smell test." If you 
make a statement about metric spaces, try it with the discrete metric. To show that (X, d) is indeed 
a metric space is left as an exercise. 

Example 7.1.7: Let C([a,Z7],]R) be the set of continuous real-valued functions on the interval [a,b]. 
Define the metric on C( [a,b], M) as 

d{f,g) ■■= sup \f{x) -g{x)\. 

x(E[a,b\ 

Let us check the properties. First, d{f,g) is finite as \f{x) — g{x) \ is a continuous function on a 
closed bounded interval [a, b], and so is bounded. It is clear that d{f,g) > 0, it is the supremum of 
nonnegative numbers. If / = ^ then \f{x) — g{x) \ =0 for all x and hence d{f, g) = 0. Conversely if 
d{f, g) = 0, then for any x we have \f{x) — g{x) \ < d{f,g) = and hence f{x) = g{x) for all x and 
f = g. That d{f,g) = d{g,f) is equally trivial. To show the triangle inequality we use the standard 
triangle inequality. 

d{f,h)= sup \f{x)-g{x)\= sup \f{x)-h{x)+h{x)-g{x)\ 

x(E[a,b] x(i[a,b\ 

< sup {\f{x)-h{x)\ + \h{x)-g{x)\) 

< sup \f{x)-h{x)\+ sup \h{x)-g{x)\=d{f,h)+d{h,g). 

x^[a.b] x(li[a,b] 

When treating C( [a, b] , M) as a metric space without mentioning a metric, we mean this particular 
metric. Notice that d{f,g) = ||/ — the uniform norm of Definition 6.1.9. 

This example may seem esoteric at first, but it turns out that working with spaces such as 
C( [a,b], M) is really the meat of a large part of modern analysis. Treating sets of functions as metric 
spaces allows us to abstract away a lot of the grubby detail and prove powerful results such as 
Picard's theorem with less work. 

Oftentimes it is useful to consider a subset of a larger metric space as a metric space itself. We 
obtain the following proposition, which has a trivial proof. 

Proposition 7.1.8. Let {X,d) be a metric space and Y dX, then the restriction J|yxK a metric 
on Y. 

Definition 7.1.9. If {X,d) is a metric space, Y CX, and d' := d\YxY^ then {Y,d') is said to be a 
subspace of (X, J). 

It is common to simply write d for the metric on Y, as it is the restriction of the metric on X. 
Sometimes we say that d' is the subspace metric and that Y has the subspace topology. 

A subset of the real numbers is bounded whenever all its elements are at most some fixed 
distance from 0. We also define bounded sets in a metric space. When dealing with an arbitrary 
metric space there may not be some natural fixed point 0. For the purposes of boundedness it does 
not matter. 
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Definition 7.1.10. Let {X,d) be a metric space. A subset S CX is said to be bounded if there exists 
a p G X and a 5 G M such that 

d{p,x) < B for all x eS. 
We say that (X, d) is bounded if X itself is a bounded subset. 

For example, the set of real numbers with the standard metric is not a bounded metric space. It 
is not hard to see that a subset of the real numbers is bounded in the sense of chapter 1 if and only if 
it is bounded as a subset of the metric space of real numbers with the standard metric. 

On the other hand, if we take the real numbers with the discrete metric, then we obtain a bounded 
metric space. In fact, any set with the discrete metric is bounded. 

7.1.1 Exercises 

Exercise 7.1.1; Show that for any set X, the discrete metric (d{x,y) = 1 ifx ^ y and d{x,x) =0) does give a 
metric space {X,d). 

Exercise 7.1.2; Let X := {0} be a set. Can you make it into a metric space? 

Exercise 7.1.3; LetX : = {a,b} be a set. Can you make it into two distinct metric spaces? (define two distinct 
metrics on it) 

Exercise 7.1.4: Let the set X := {A,B,C} represent 3 buildings on campus. Suppose we wish our distance to 
be the time it takes to walk from one building to the other It takes 5 minutes either way between buildings A 
and B. However, building C is on a hill and it takes 10 minutes from A and 15 minutes from B to get to C. On 
the other hand it takes 5 minutes to go from C to A and 7 minutes to go from C to B, as we are going downhill. 
Do these distances define a metric? If so, prove it, if not, say why not. 

Exercise 7.1.5; Suppose that {X,d) is a metric space and (p: [0,°°] — >• M is an increasing function such 
that (p{t) > Q for all t and (p[t) = if and only if t = 0. Also suppose that (p is subadditive, that is 
(p{s + t) < (p{s) + (p{t). Show that with d'{x,y) := (p(^d{x,y)y we obtain a new metric space {X,d'). 

Exercise 7.1.6; Let (X,dx) and (F,<iy) be metric spaces. 

a) Show that (X x Y,d) with d[{xi,yi), {x2,y2)) '■= dx{xi,X2) + d¥{yi,y2) i^ <^ metric space. 

b) Show that (X xY,d) with d(^{xi,yi),{x2,y2)) '■= vtia'>i-{dx{x[,X2),dY(yi,y2)} is a metric space. 

Exercise 7.1.7; Let X be the set of continuous functions on [0, 1]. Let (p: [0, 1] — t- (0,oo) be continuous. 
Define 




Show that {X,d) is a metric space. 
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Exercise 7.1.8: Let {X,d) be a metric space. For nonempty bounded subsets A and B let 

d{x,B):=mf{d{x,b):beB} and d{A,B) := sup{d{a,B) : a eA}. 
Now define the Hausdorff metric as 

dH(A,B) :=max{d{A,B),d{B,A)}. 

Note: dn can be defined for arbitrary nonempty subsets if we allow the extended reals. 

a) Let Y C ^{X) be the set of bounded nonempty subsets. Show that {Y,dH) is a metric space, b) Show by 

example that d itself is not a metric. That is, d is not always symmetric. 
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7.2 Open and closed sets 

Note: 2 lectures 

7.2.1 Topology 

It is useful to define a so-called topology. That is we define closed and open sets in a metric space. 
Before doing so, let us define two special sets. 

Definition 7.2.1. Let {X, d) be a metric space, xEX and 5 > 0. Then define the open ball or simply 
ball of radius 5 around x as 

B{x,8) := {yeX:d{x,y) < d}. 
Similarly we define the closed ball as 

C{x,5) ■.= {yeX:d{x,y) <5}. 

When we are dealing with different metric spaces, it is sometimes convenient to emphasize 
which metric space the ball is in. We do this by writing Bxix, 5) := B{x, 5) or Cx{x, 5) := C{x, 5). 

Example 7.2.2: Take the metric space M with the standard metric. For x G M, and 5 > we get 

B{x,5) = {x— 5,x-\- 5) and C{x,5) = [x — 5,x-\-5]. 

Example 7.2.3: Be careful when working on a subspace. Suppose we take the metric space [0, 1] 
as a subspace of M. Then in [0, 1] we get 

5(0,1/2) =5[o,i](0,l/2) = [0,1/2). 

This is of course different from 5k(0, 1/2) = (— 1/2, 1/2). The important thing to keep in mind is 
which metric space we are working in. 

Definition 7.2.4. Let (X, d) be a metric space. A set V C X is open if for every xEV , there exists 
a 5 > such that B{x, d) C V . See Figure 7.3. A set £ C Z is closed if the complement =X\E 
is open. When the ambient space X is not clear from context we say V is open in X and E is closed 
inX. 

If X G y and V is open, then we say that V is an open neighborhood of x (or sometimes just 
neighborhood). 

Intuitively, an open set is a set that does not include its "boundary." Note that not every set is 
either open or closed, in fact generally most subsets are neither. 

Example 7.2.5: The set [0, 1) C M is neither open nor closed. First, every ball in M around 0, 
(—5, 5) contains negative numbers and hence is not contained in [0, 1) and so [0, 1) is not open. 
Second, every ball in M around 1,(1 — 5,1 + 5) contains numbers strictly less than 1 and greater 
than (e.g. 1 — ^/i as long as 5 < 2). Thus M \ [0, 1) is not open, and so [0, 1) is not closed. 
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Proposition 7.2.6. Let {X,d) be a metric space. 

( i) and X are open in X. 

( ii) If Vi,V2, ■■■ ,Vk are open then 

k 

is also open. That is, finite intersection of open sets is open. 
(Hi) If{Vx}i^j is an arbitrary collection of open sets, then 

is also open. That is, union of open sets is open. 

Note that the index set in (iii) is arbitrarily large. By Uag/^A we simply mean the set of all x 
such that X eVi for at least one A G /. 

Proof. The sets X and are obviously open in X. 

Let us prove (ii). If x G fl/^i Vj, then x G Vj for all j. As Vj are all open, there exists a dj > 
for every j such that B{x, 5j) C Vj. Take 5 := min{5i, . . . , 5^} and note that 5 > 0. We have 
B{x, d) C B{x, 5j) C Vj for every j and thus B{x, 5) C 0^=1 X/- Thus the intersection is open. 

Let us prove (iii). If x G Uag/^A' then x eVx for some A G /. As V^ is open then there exists a 
5 > such that B{x, 5) CV^. But then B{x, d) C Uag/X^ so the union is open. □ 

Example 7.2.7: The main thing to notice is the difference between items (ii) and (iii). Item (ii) is 
not true for an arbitrary intersection, for example nr=i(~V"5 V") = {0}^ which is not open. 

The proof of the following analogous proposition for closed sets is left as an exercise. 
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Proposition 7.2.8. Let {X,d) be a metric space. 

( i) and X are closed in X. 

(ii) If{Ex}xei arbitrary collection of closed sets, then 

is also closed. That is, intersection of closed sets is closed. 
( Hi) IfEi,E2,...,Ej^ are closed then 

k 

is also closed. That is, finite union of closed sets is closed. 

We have not yet shown that the open ball is open and the closed ball is closed. Let us show this 
fact now to justify the terminology. 

Proposition 7.2.9. Let {X,d) be a metric space, x&X, and 5 > 0. Then B{x, 5) is open and C{x, 5) 
is closed. 

Proof. Let y e B{x, 5). Let a := 5 - d{x,y). Of course a > 0. Now let z e B{y, a). Then 

d{x,z) < d{x,y) +d{y,z) < d{x,y) + oc = d{x,y) + 5 - d{x,y) = 5. 

Therefore z G B{x, 5) for every z G B{y, a). So B{y, a) C B(x, 5) and B{x, 5) is open. 

The proof that C(jc, 5) is closed is left as an exercise. □ 

Again be careful about what is the ambient metric space. As [0, 1/2) is an open ball in [0, 1], this 
means that [0, 1/2) is an open set in [0, 1]. On the other hand [0, 1/2) is neither open nor closed in M. 

A useful way to think about an open set is a union of open balls. If U is open, then for each 
X G i/, there is a > (depending on jc of course) such that 5 (jc, 5^) CU. Thent/ — \Jj^^ijB{x,dx). 

The proof of the following proposition is left as an exercise. Note that there are other open and 
closed sets in R. 

Proposition 7.2.10. Let a <b be two real numbers. Then {a, b), {a, 0°), and (— °o, b) are open in R. 
Also [a, b], [fl,o°), and (—<»,&] are closed in R. 

7.2.2 Connected sets 

Definition 7.2.11. A nonempty metric space {X,d) is connected if the only subsets that are both 
open and closed are and X itself. 

When we apply the term connected to a nonempty subset A C Z, we simply mean that A with 
the subspace topology is connected. 
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In other words, a nonempty X is connected if whenever we write X =XiUX2 where Xi 0X2 = 
and Xi and X2 are open, then either Xi = or X2 = 0. So to test for disconnectedness, we need to 
find nonempty disjoint open sets Xi and X2 whose union is X. For subsets, we state this idea as a 
proposition. 

Proposition 7.2.12. Let {X,d) be a metric space. A nonempty set S CX is not connected if and 
only if there exist open sets Ui and U2 in X, such that i7i fl [/2 H 5 = 0, t/i fl S 7^ 0, f/2 H 5 7^ 0, and 

S= ([/in5)u([/2n5). 

Proof. If Uj is open in X, then Uj (IS is open in S in the subspace topology (with subspace metric). 
To see this, note that if Bx{x, 5) C Uj, then as Bs{x, 5) = Sr\Bx{x, 5), we have ^^(jc, 5) C UjCiS. 
The proof follows by the above discussion. 

The proof of the other direction follows by using Exercise 7.2.12 to find U{ and U2 from two 
open disjoint subsets of S. □ 

Example 7.2.13: Let 5 C M be such that x< z<y with x,y eS and z^S. Claim: S is not connected. 
Proof: Notice 

((-oo,z)n5)u((z,oo)n5) =5. (7.1) 

Proposition 7.2.14. A set S cM^is connected if and only if it is an interval or a single point. 

Proof. Suppose that S is connected (so also nonempty). If 5 is a single point then we are done. 
So suppose that x <y and x,y E S. If z is such that x < z < y, then (— °o,z) fl 5 is nonempty and 
(z, 00) n 5 is nonempty. The two sets are disjoint. As S is connected, we must have they their union 
is not 5, so z G S. 

Suppose that S is bounded, connected, but not a single point. Let a := inf 5 and /3 := sup S 
and note that a < /3. Suppose a < z < p. As a is the infimum, then there is an x G 5 such that 
a < X < z. Similarly there is a y G 5 such that P >y > z. We have shown above that z G 5, so 
(a, /3) C S. If w < a, then w ^ 5 as a was the infimum, similarly if w > /3 then w ^ S. Therefore 
the only possibilities for S are (a,j8), [o(,j8), (o(,/3], [a,/3]. 

The proof that an unbounded connected S is an interval is left as an exercise. 

On the other hand suppose that S is an interval. Suppose that Ui and U2 are open subsets of M, 
UiHS and t/2 n 5 are nonempty, and 5 = (t/i fl 5) U {U2 fl 5) . We will show that UiHS and t/2 n 5 
contain a common point, so they are not disjoint, and hence S must be connected. Suppose that 
there is x E UiDS and y EU2r\S. We can assume that x <y. As 5 is an interval [x,y] C S. Let 
z:=inf(t/2n[jc,j]). lfz = x, thenzG C/i. If z > x, then for any 5 > the ball 5 (z, 5) = {z-5,z+5) 
contains points that are not in U2, and so z ^ f/2 as U2 is open. Therefore, z EUi. As f/i is open, 
B(z, 5) C Ui for a small enough 5 > 0. As z is the infimum of U2 n [x,y], there must exist some 
wEU2n [x,y] such that w G [z,z + 5) C 5(z, 5) C Ui. Therefore w G C/i n i72 n [x,y]. So f/i n5 and 
[/2 n 5 are not disjoint and hence S is connected. □ 

Example 7.2.15: In many cases a ball B{x, 5) is connected. But this is not necessarily true in every 
metric space. For a simplest example, take a two point space {a,b} with the discrete metric. Then 
5(a,2) = {a,b}, which is not connected as 1) = {a} and B{b, 1) = {b} are open and disjoint. 
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7.2.3 Closure and boundary 

Sometime we wish to take a set and throw in everything that we can approach from the set. This 
concept is called the closure. 

Definition 7.2.16. Let (X, d) be a metric space and AcX. Then the closure of A is the set 

A := f]{E cX:Eis closed and AcE}. 
That is, A is the intersection of all closed sets that contain A. 

Proposition 7.2.17. Let {X,d) be a metric space and A C X. The closure A is closed. Furthermore 
if A is closed then A=A. 

Proof. First, the closure is the intersection of closed sets, so it is closed. Second, if A is closed, then 
take E =A, hence the intersection of all closed sets E containing A must be equal to A. □ 

Example 7.2.18: The closure of (0, 1) in R is [0, 1]. Proof: Simply notice that if E is closed and 
contains (0, 1), then E must contain and 1 (why?). Thus [0, 1] C E. But [0, 1] is also closed. 
Therefore the closure (0, 1) = [0, 1]. 

Example 7.2.19: Be careful to notice what ambient metric space you are working with. If X = 
(0,oo), then the closure of (0, 1) in (0,oo) is (0, 1]. Proof: Similarly as above (0, 1] is closed in (0,oo) 
(why?). Any closed set E that contains (0, 1) must contain 1 (why?). Therefore (0, 1] C E, and 
hence (0, 1) = (0, 1] when working in (0,oo). 

Let us justify the statement that the closure is everything that we can "approach" from the set. 

Proposition 7.2.20. Let {X,d) be a metric space and A C X. Then x e A if and only if for every 
d>0,B{x,5)nAj^(d. 

Proof. Let us prove the two contrapositives. Let us show that x ^ A if and only if there exists a 
d>0 such that B{x, d) HA = 0. 

First suppose that x ^A. We know A is closed. Thus there is a 5 > such that B(x,5) cA'' . As 
A c A we see that B{x, 5) C A'^ and hence B{x, 5) nA = 0. 

On the other hand suppose that there is a 5 > such that B{x, 5) n A = 0. Then 5(x, dy is a 
closed set and we have that A C B{x, 5^, but x ^ B{x, 5y. Thus as A is the intersection of closed 
sets containing A, we have x^A. □ 

We can also talk about what is in the interior of a set and what is on the boundary. 

Definition 7.2.21. Let {X,d) be a metric space and A c X, then the interior of A is the set 

A° -.^{xeA: there exists a 5 > such that B{x, 5) C A}. 
The boundary of A is the set 

dA:=A\A°. 
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Example 7.2.22: Suppose A = (0, 1] and X = M. Then it is not hard to see that A = [0, 1], A° = 

(0,l),and^A = {0,l}. 

Example 7.2.23: Suppose X = {a,b} with the discrete metric. Let A = {a}, then A = A° and 
(9A = 0. 

Proposition 7.2.24. Let {X,d) be a metric space and A C X. ThenA° is open and dA is closed. 

Proof. Given x eA° we have 5 > such that B(x, 5) C A. If z E B{x, 5), then as open balls are 
open, there is an e > such that e) c B{x, 5) C A, so z is in A°. Therefore 5) C A° and so 
A° is open. 

As A° is open, then (9A = A \ A° = A n (A°)'' is closed. □ 

The boundary is the set of points that are close to both the set and its complement. 

Proposition 7.2.25. Let {X,d) be a metric space and A C X. Then x E dA if and only if for every 
5 > 0, 5) flA and B{x,d) HA'-' are both nonempty. 

Proof. Suppose that x E dA = A\A° and let 5 > be arbitrary. By Proposition 7.2.20, B{x, 5) 
contains a point from A. If B{x, 5) contained no points of A'-', then x would be in A°. Hence B{x, d) 
contains a point of A^ as well. 

Let us prove the other direction by contrapositive. Ifx^A, then there is some 5 > such that 
B{x, 5) CA^ as A is closed. So B{x, 5) contains no points of A. 

Now suppose that x eA°, then there exists a 5 > such that B{x,5) C A, but that means that 
B{x, d) contains no points of A''. □ 

We obtain the following immediate corollary about closures of A and A^. We simply apply 
Proposition 7.2.20. 

Corollary 7.2.26. Let {X,d) be a metric space and A C X. Then dA= A flA*^'. 

7.2.4 Exercises 

Exercise 7.2.1; Prove Proposition 7.2.8. Hint: consider the complements of the sets and apply Proposi- 
tion 7.2.6. 

Exercise 7.2.2: Finish the proof of Proposition 7.2.9 by proving that C{x, d) is closed. 
Exercise 7.2.3: Prove Proposition 7.2.10. 

Exercise 7.2.4; Suppose that {X,d) is a nonempty metric space with the discrete topology. Show thatX is 
connected if and only if it contains exactly one element. 

Exercise 7.2.5; Show that ifS cM. is a connected unbounded set, then it is an ( unbounded) interval 
Exercise 7.2.6; Show that every open set can be written as a union of closed sets. 
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Exercise 7,2.7; a) Show that E is closed if and only if dE C £. b) Show that U is open if and only if 
dU^U = <b. 

Exercise 7.2.8: a) Show that A is open if and only ifA° =A.b) Suppose that U is an open set and U gA. 
Show that U CA°. 

Exercise 7.2.9; Let X be a set and d, d' be two metrics on X. Suppose that there exists an a > and j8 > 

such that ad(x,y) < d'(x,y) < pd{x,y) for all x,y G X. Show that U is open in {X,d) if and only ifU is open 
in {X,d'). That is, the topologies of(X,d) and {X,d') are the same. 

Exercise 7.2.10; Suppose that {Si}, i gN is a collection of connected subsets of a metric space {X,d). 
Suppose that there exists anx & X such that x G Si for all i G N. Show that U;li Si is connected. 

Exercise 7.2.11; Let Abe a connected set a) Is A connected? Prove or find a counterexample, b) Is A° 
connected? Prove or find a counterexample. Hint: Think of sets in M^. 

The definition of open sets in the following exercise is usually called the subspace topology. You are 
asked to show that we obtain the same topology by considering the subspace metric. 

Exercise 7.2.12; Suppose {X,d) is a metric space and Y cX. Show that with the subspace metric on Y, a 
setU C F is open (inY) whenever there exists an open setV GX such that U = V (lY. 

Exercise 7.2.13; Let (X,d) be a metric space, a) For any x EX and 5 > 0, show B{x, 5) C C{x, 5). b) Is it 
always true that B{x, 5) = C{x, 5)? Prove or find a counterexample. 

Exercise 7.2.14; Let {X,d) be a metric space and A C X. Show thatA° = \J{V -.V cAis open}. 



7.3. SEQUENCES AND CONVERGENCE 



221 



7.3 Sequences and convergence 

Note: 1 lecture 

7.3.1 Sequences 

The notion of a sequence in a metric space is very similar to a sequence of real numbers. 

Definition 7.3.1. A sequence in a metric space {X,d) is a function x: N — > Z. As before we write 
Xn for the nth element in the sequence and use the notation {jc„}, or more precisely 

{xn\n=\- 

A sequence {x„} is bounded if there exists a point p & X and 5 e R such that 

d(p,Xn)<B forallneN. 

In other words, the sequence is bounded whenever the set {xn : n G N} is bounded. 

If {n;}JLj is a sequence of natural numbers such that nj+i > nj for all j then the sequence 
{xnj}y^i is said to be a subsequence of 

Similarly we also define convergence. Again, we will be cheating a Httle bit and we will use the 
definite article in front of the word limit before we prove that the limit is unique. 

Definition 7.3.2. A sequence {x„} in a metric space {X,d) is said to converge to a point p eX,if 
for every e > 0, there exists an M e N such that ri(xn, p) <e for all n>M. The point p is said to 
be the limit of {jc„}. We write 

lim Xn := p. 

n— >oo 

A sequence that converges is said to be convergent. Otherwise, the sequence is said to be 

divergent. 

Let us prove that the limit is unique. Note that the proof is almost identical to the proof of the 
same fact for sequences of real numbers. Many results we know for sequences of real numbers can 
be proved in the more general settings of metric spaces. We must replace \x — y\ with d{x,y) in the 
proofs and apply the triangle inequaUty correctly. 

Proposition 7.3.3. A convergent sequence in a metric space has a unique limit. 

Proof. Suppose that the sequence {jc„} has the limit x and the limit y. Take an arbitrary e > 0. From 
the definition we find an Mi such that for all n>M\,d{xmx) < e/2. Similarly we find an M2 such 
that for all n > M2 we have d{xn,y) < ^ji. Now take an n such that n>Mi and also n > M2 

d{y,x) < d{y,x„) +d{xn,x) 
£ £ 

As d{y,x) < £ for all e > 0, then d{x,y) = and y = x. Hence the limit (if it exists) is unique. □ 
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The proofs of the following propositions are left as exercises. 
Proposition 7.3.4. A convergent sequence in a metric space is bounded. 

Proposition 7.3.5. A sequence {xn} in a metric space {X,d) converges to p & X if and only if there 
exists a sequence {a„} of real numbers such that 

d{x„,p) < a„ for all n&N, 

and 

lira an = 0. 

Proposition 7.3.6. Let {x„} be a sequence in a metric space (X, J). 

(i) If{xn} converges to p EX, then every subsequence {^m^.} converges to p. 

(ii) If for some K eN the K-tail {xn}n=K+i converges to p EX, then {xn} converges to p. 

7.3.2 Convergence in euclidean space 

It is useful to note what convergence means in the euclidean space M". 

Proposition 7.3.7. Let {x^}°J^^ be a sequence in W^, where we write x^ — {x\.,X2i ■ ■ ■ ,Xn) G K". 
Then {jJ-^jJ^j converges if and only ifixj^yj^i converges for every k, in which case 

lim X-' = ( lim x{ , lim xL--., lim xl ) . 

Proof. For M = the result is immediate. So let n > 1 . 

Let {xj}J^i be a convergent sequence in W\ where we write x^ = (jcjjXj, . . . G M". Let 
X — {xi,X2-, . . . , JCn) e M" be the limit. Given e > 0, there exists an M such that for all j >M we have 

d{x,x-^) < £. 

Fix some k= 1 , 2, . . . , n. For 7 > M we have 



i\^y{xk-4) ^d{x,xj)<e. (7.2) 



£=l 



Hence the sequence {x-jj^^i converges to x^. 

For the other direction suppose that {xj^}J^^ converges to x^ for every k = 1,2, ... Hence, 
given e > 0, pick an M, such that if j >M then |jc^ — jc^ | < e/^ for all A; = 1 , 2, . . . , n. Then 



d{x,x-')^ jY,{^k-xl) < 

k=l 



k=i « 



The sequence {x^} converges to x e M" and we are done. □ 
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7.3.3 Convergence and topology 

The topology, that is, the set of open sets of a space encodes which sequences converge. 

Proposition 7.3.8. Let {X,d) be a metric space and {xn} a sequence inX. Then {x„} converges 
to X eX if and only if for every open neighborhood U ofx, there exists an M eN such that for all 
n>M we have Xn G U. 

Proof. First suppose that {xn} converges. Let U be an open neighborhood of x, then there exists an 
£ > such that B{x^ e) C U . As the sequence converges, find an M G N such that for all n > M we 
have d{x,Xn) < £ or in other words Xn G B{x, e) C U . 

Let us prove the other direction. Given £ > let i7 := B{x, e) be the neighborhood of x. Then 
there is an M G N such that for n > M we have Xn EU = B{x, e) or in other words, d{x, jc„) < £. □ 

A set is closed when it contains the limits of its convergent sequences. 

Proposition 7.3.9. Let {X,d) be a metric space, E cX a closed set and {xn} a sequence in E that 
converges to some xEX. Then x EE. 

Proof. Let us prove the contrapositive. Suppose is a sequence in X that converges to x G E'^ . 
As E^' is open. Proposition 7.3.8 says there is an M such that for all n > M, Xn E So {x„} is not 
a sequence in □ 

When we take a closure of a set A, we really throw in precisely those points that are limits of 
sequences in A. 

Proposition 7.3.10. Let {X, d) be a metric space and A C X. Ifx E A, then there exists a sequence 
{xn} of elements in A such that lim Xn = x. 

Proof. Let xEA. We know by Proposition 7.2.20 that given i/n, there exists a point Xn E B{x, i/n) fl A. 
As d{x^Xn) < V"' we have that lim Xn=x. □ 

7.3.4 Exercises 

Exercise 7.3.1: Let {X,d) be a metric space and let A C X. Let E be the set of all x EX such that there 
exists a sequence {x,,} in A that converges to x. Show that E =A. 

Exercise 7.3.2; a) Show that d{x,y) := min{l, |x — j|} defines a metric on M. b) Show that a sequence 
converges in (M,(i) if and only if it converges in the standard metric, c) Find a bounded sequence in (M.,d) 
that contains no convergent subsequence. 

Exercise 7.3.3: Prove Proposition 7.3.4. 



Exercise 7.3.4: Prove Proposition 7.3.5. 
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Exercise 7.3.5; Suppose that {x„}"^ j converges to x. Suppose that f: N — )• N a one-to-one function. Show 
that {xfi^n)}n=\ converges to x. 

Exercise 7.3.6: If{X,d) is a metric space where d is the discrete metric. Suppose that {x„} is a convergent 
sequence in X. Show that there exists a K £N such that for all n>K we have x„ = xk. 

Exercise 7.3.7: A set S C X is said to be dense in X if for every xG X, there exists a sequence {x„} in S that 
converges to x. Prove that M" contains a countable dense subset. 

Exercise 7.3.8 (Tricky): Suppose {?/n},7=i be a decreasing (Un+i C U„for all n) sequence of open sets in a 
metric space {X,d) such that 0^=1 = {p} for some p £ X. Suppose that {x„} is a sequence of points in X 
such that Xn € Un. Does {x,,} necessarily converge to p? Prove or construct a counterexample. 

Exercise 7.3.9: Let E gX be closed and let {x„} be a sequence in X converging to p GX. Suppose Xn € E 
for infinitely many n S N. Show p £ E. 

Exercise 7.3.10: Take M* = {— oo}uMU{oo} be the extended reals. Define d{x,y) := \]^^ — \ 
x,y G M, define d{oo,x) := |l — j:^!, d{—°°,x) := |l + j^^^\forallx G M, and let d {00,-00^ := 2. a) Show 
that (R* ,d) is a metric space, b) Suppose that {x„} is a sequence of real numbers such that for every M G M, 
there exists an N such that x„ > Mfor all n>N. Show that lim x„ = 00 in (M* , J), c) Show that a sequence of 
real numbers converges to a real number in (M* , d) if and only if it converges in M with the standard metric. 

Exercise 7.3.11: Suppose that {V„}~^j is a collection of open sets in {X,d) such that V„+i D V„. Let {x,,} 
be a sequence such that x„ G V,j+i \ V„ and suppose that {x„} converges to p £X. Show that p G dV where 

v=[j:=iVn. 



Exercise 7.3.12: Prove Proposition 7.3.6. 
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7.4 Completeness and compactness 

Note: 2 lectures 

7.4.1 Cauchy sequences and completeness 

Just like with sequences of real numbers we define Cauchy sequences. 

Definition 7.4.1. Let {X,d) be a metric space. A sequence {x„} in X is a Cauchy sequence if for 
every e > there exists an M G N such that for all « > M and all > M we have 

d{Xn,Xk) < £. 

The definition is again simply a translation of the concept from the real numbers to metric 
spaces. So a sequence of real numbers is Cauchy in the sense of chapter 2 if and only if it is Cauchy 
in the sense above, provided we equip the real numbers with the standard metric d{x,y) = \x — y\. 

Proposition 7.4.2. A convergent sequence in a metric space is Cauchy. 

Proof. Suppose {xn} converges to x. Given e > there is an M such that for n > M we have 
d{x,Xn) < ^/2. Hence for all n,k > M we have d{xn,Xk) < d{xn,x) -\-d{x,Xk) < £/2 + e/2 = e. □ 

Definition 7.4.3. Let {X,d) be a metric space. We say thatX is complete or Cauchy-complete if 
every Cauchy sequence {x„} in X converges to an ;c G X. 

Proposition 7.4.4. The space M" with the standard metric is a complete metric space. 

Proof. For M = this was proved in chapter 2. 

Take n> \. Let {x^°°^^ be a Cauchy sequence in M", where we write x^ = {x\,X2, ■■■,xi) G M". 
As the sequence is Cauchy, given e > 0, there exists an M such that for all /, j > M we have 

d{x\x^) < £. 

Fix some k= 1 , 2, . . . , n, for /, j > M we have 

14 --^^1 = 7(4-4)' < J E (4 -4f = d{x\x^) < e. (7.3) 

V e=i 

Hence the sequence {4)7=1 Cauchy. As M is complete the sequence converges; there exists an 

jca: G M such that Xk = limj^cx,4- 

Write X = {xi,X2, ■ ■ ■ G W\ By Proposition 7.3.7 we have that {x-^} converges to x G M" and 
hence M" is complete. □ 
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7.4.2 Compactness 

Definition 7.4.5. Let (X^d) be a metric space and K cX. The set K is said to be compact if for 
any collection of open sets {C/aIag/ such that 

U^/a, 

there exists a finite subset {Ai , A2, . . . , A^t} C / such that 

J=l 

A collection of open sets {U^jx^i as above is said to be a open cover of ^. So a way to say that 
K is compact is to say that every open cover ofK has a finite subcover. 

Proposition 7.4.6. Let {X,d) be a metric space. A compact set K dX is closed and bounded. 

Proof. First, we prove that a compact set is bounded. Fix p eX. We have the open cover 

00 

KG [jB{p,n)=X. 

If K is compact, then there exists some set of indices n\ <n2< . . . <n]^ such that 

k 

Kd \jB{p,nj)^B{p,nk). 
7=1 

As K is contained in a ball, K is bounded. 

Next, we show a set that is not closed is not compact. Suppose that K ^ K, that is, there is a 
point xe K\K. Ify^x, then for n with i/n < d{x, y) we have y ^ C(x, . Furthermore x ^ ^, so 

00 

K(z[jCix,l/nY. 

n=l 

As a closed ball is closed, C(x, i/n)'^ is open, and so we have an open cover. If we take any finite 
collection of indices ni <n2< ... <nk, then 

k 

\JC{x,ynjY = C{x,l/n,Y 

As X is in the closure, we have C(x, i/n^t) r\K ^Q), so there is no finite subcover and K is not 
compact. □ 
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We prove below that in finite dimensional euclidean space every closed bounded set is compact. 
So closed bounded sets of M" are examples of compact sets. It is not true that in every metric space, 
closed and bounded is equivalent to compact. There are many metric spaces where closed and 
bounded is not enough to give compactness, see Exercise 7.4.8. However, see Exercise 7.4.12. 

A useful property of compact sets in a metric space is that every sequence has a convergent 
subsequence. Such sets are sometimes called sequentially compact. Let us prove that in the context 
of metric spaces, a set is compact if and only if it is sequentially compact. First we prove a lemma. 

Lemma 7.4.7 (Lebesgue covering lemma*). Let {X,d) be a metric space and K C X. Suppose that 
every sequence in K has a subsequence convergent in K. Given an open cover {Ux}x^i ofK, there 
exists a 5 > such that for every x E K, there exists a X E I with B{x, d) C Ux. 

Proof. Let us prove the lemma by contrapositive. If the conclusion is not true, then there is an open 
cover {Ux}xei of ^ ^i^h the following property. For every n eN there exists an .x:„ G ^ such that 
B{x„, i/n) is not a subset of any Ux ■ Given any x E K, there is a A G / such that xEUx- Hence there 
is an e > such that B{x, e) cUx - Take M such that i/m < e/2. If j G B{x, e/2) and n>M, then by 
triangle inequality 

B{y, l/n) C B{y, i/m) C B{x,£) C Ux- (7.4) 

In other words, for all n> M, Xn ^ B{x,^/2). Hence the sequence cannot have a subsequence 
converging io x. A^ x E K was arbitrary we are done. □ 

Theorem 7.4.8. Let {X,d) be a metric space. Then K G X is a compact set if and only if every 
sequence in K has a subsequence converging to a point in K. 

Proof. Let ^ C X be a set and {x„} a sequence in K. Suppose that for each x E K, there is a ball 
B{x, ttx) for some > such that Xn E B(x, ttx) for only finitely many n eN. Then 

KG \jB{x,ax). 

Any finite collection of these balls is going to contain only finitely many Xn- Thus for any finite 
collection of such balls there is an jc„ G ^ that is not in the union. Therefore, K is not compact. 

So if K is compact, then there exists anx E K such that for any 5 > 0, B{x, 5) contains Xk for 
infinitely many k eN. The ball B{x,l) contains some Xj^ so let ni := k. If is defined, then 
there must exist ak> nj^i such that x^ E B{x, 1/ ;), so define nj := k. Notice that d{x,Xnj) < V j. By 
Proposition 7.3.5, lim.x;„^. = x. 

For the other direction, suppose that every sequence in K has a subsequence converging in K. 
Take an open cover {Ux}xei of ^- Using the Lebesgue covering lemma above, we find a 5 > 
such that for every x, there is a A G / with B{x, 5) G Ux- 



*Named after the French mathematician Henri Leon Lebesgue (1875 - 1941). The number 5 is sometimes called 
the Lebesgue number of the cover. 
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Pick xi & K and find Ai e / such that B{xi , 5) C . If ^ C U^^ , we stop as we have found 
a finite subcover. Otherwise, there must be a point X2 E K\ Ux^ ■ Note that d{x2:X\) > 5. There 
must exist some A2 G / such that 5(^25 5) G Ux2- We work inductively. Suppose that is defined. 
Either Ux^ U ^7^2 U • ■ ■ U is a finite cover of ^, in which case we stop, or there must be a point 
XneK \ [Ux^ U t/;t2 U ■ ■ ■ U ) . Note that d{xn,Xj) > 5 for all j = 1 , 2, . . . , n - 1 . Next, there 
must be some Xn G / such that 5(jc„, 5) C Ux^. 

Either at some point we obtain a finite subcover of K or we obtain an infinite sequence {jc„} as 
above. For contradiction suppose that there is no finite subcover and we have the sequence {jc„}. 
For all n and k,n^k, we have d{x„,Xk) > 5, so no subsequence of {x„} can be Cauchy. Hence no 
subsequence of {x„} can be convergent, which is a contradiction. □ 

Example 7.4.9: The Bolzano-Weierstrass theorem for sequences of real numbers (Theorem 2.3.8) 
says that any bounded sequence in M has a convergent subsequence. Therefore any sequence in a 
closed interval [a,b] C M has a convergent subsequence. The limit must also be in [a,b] as limits 
preserve non-strict inequalities. Hence a closed bounded interval [a,b] C M is compact. 

Proposition 7.4.10. Let {X,d) be a metric space and let K dX be compact. Suppose that E C K 
is a closed set, then E is compact. 

Proof. Let {x,,} be a sequence m E. It is also a sequence in K. Therefore it has a convergent 
subsequence {xnj } that converges to some x E K. As, E h closed the limit of a sequence in E is also 
in E and %o x EE. Thus E must be compact. □ 

Theorem 7.4.11 (Heine-Bore^). A closed bounded subset K C M" is compact. 

Proof. For M = MMf ^ C M is closed and bounded, then any sequence {x^} in K is bounded, so it 
has a convergent subsequence by Bolzano-Weierstrass theorem (Theorem 2.3.8). As K is closed, 
the limit of the subsequence must be an element of K. So K is compact. 

Let us carry out the proof for n = 2 and leave arbitrary n as an exercise. As ^ C is bounded, 
there exists a set 5 = [a^b] x [c, J] C such that K C B. We will show that B is compact. Then K, 
being a closed subset of a compact B, is also compact. 

Let {{xk,yk)}'k=i be a sequence in B. That is, a < x^ < b and c <yk < d for all k. A bounded 
sequence of real numbers has a convergent subsequence so there is a subsequence {xk^}J^i that is 
convergent. The subsequence {ykj}J^i is also a bounded sequence so there exists a subsequence 
{yk . }T=i '^hat is convergent. A subsequence of a convergent sequence is still convergent, so {x^ . 



WKj.J,= l 

is convergent. Let 



X := limxyt -. 

I — ^00 -^i 



and y := limyj^. . 



By Proposition 7.3.7, { {xkj.,ykj. ) ] 



i=\ 



converges to (x, y) . Furthermore, a& a <Xk<b and c <yt<d 



for all k, we know that {x,y) G B. 



□ 



*Named after the German mathematician Heinrich Eduard Heine (1821-1881), and the French mathematician Felix 
Edouard Justin EmileBorel (1871-1956). ~ 
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7.4.3 Exercises 

Exercise 7.4.1: Let {X,d) be a metric space and A a finite subset ofX. Show that A is compact. 

Exercise 7.4.2; Let A = {^/n : n £ N} C M. a) Show that A is not compact directly using the definition, b) 
Show that AD {0} is compact directly using the definition. 

Exercise 7.4.3: Let {X,d) be a metric space with the discrete metric, a) Prove that X is complete, b) Prove 
thatX is compact if and only ifX is a finite set. 

Exercise 7.4.4; a) Show that the union of finitely many compact sets is a compact set. b) Find an example 
where the union of infinitely many compact sets is not compact. 

Exercise 7.4.5; Prove Theorem 7.4.11 for arbitrary dimension. Hint: The trick is to use the correct notation. 
Exercise 7.4.6: Show that a compact set K is a complete metric space. 

Exercise 7.4.7: Let C{[a,b],M.) be the metric space as in Example 7.1.7. Show that C{[a,b],M.) is a complete 
metric space. 

Exercise 7.4.8 (Challenging); LetC{\f), 1],M) be the metric space of Example 7.1.7. LetO denote the zero 
function. Then show that the closed ball C(0, 1) is not compact (even though it is closed and bounded). Hints: 
Construct a sequence of distinct continuous functions {/„} such that d{f„,0) = 1 and d{fn,fk) = ifor all 
n^k. Show that the set {/„ : n G N} C C(0, 1) is closed but not compact. See chapter 6 for inspiration. 

Exercise 7.4.9 (Challenging); Show that there exists a metric on M that makes R into a compact set. 

Exercise 7.4.10; Suppose that {X,d) is complete and suppose we have a countably infinite collection of 
nonempty compact sets E\ D £2 D £3 D • • • then prove P\j=\ 7^ 0- 

Exercise 7.4.11 (Challenging); Let C([0, 1],M) be the metric space of Example 7.1.7. Let K be the set 

of f G C([0, 1],M) such that f is equal to a quadratic polynomial, i.e. f{x) = a + bx + cx^, and such that 
\f{^)\ ^ I for all x£ [0,1], that is f £ C{0,1). Show that K is compact. 

Exercise 7.4.12 (Challenging); Let {X,d) be a complete metric space. Show that K C X is compact if 
and only if K is closed and such that for every e > there exists a finite set of points xi,. .. ,Xn with 
K C Uj=i £)■ Note: Such a set K is said to be totally bounded, so we can say that in a complete metric 
space a set is compact if and only if it is closed and totally bounded. 

Exercise 7.4.13; Take N C M using the standard metric. Find an open cover ofN such that the conclusion of 
the Lebesgue covering lemma does not hold. 

Exercise 7.4.14; Prove the general Bolzano-Weierstrass theorem: Any bounded sequence {xj^} in M" has a 
convergent subsequence. 
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7.5 Continuous functions 

Note: 1 lecture 



7.5.1 Continuity 

Definition 7.5.1. Let {X,dx) and {Y^dy) be metric spaces and c EX. Then / : X — )■ 7 is contin- 
uous at c if for every e > there is a 5 > such that whenever x EX and dx{x,c) < 5, then 
dY{fix)J{c))<e. 

When / : Z — )• y is continuous at all c EX, then we simply say that / is a continuous function. 

The definition agrees with the definition from chapter 3 when / is a real-valued function on the 
real line, if we take the standard metric on M. 

Proposition 7.5.2. Let {X,dx) and (7, dy) be metric spaces. Then f: X -^Y is continuous at c EX 
if and only if for every sequence {x„} in X converging to c, the sequence {f{xn)} converges to f{c). 

Proof. Suppose that / is continuous at c. Let {xn} be a sequence in X converging to c. Given e > 0, 
there is a 5 > such that d{x,c) < 5 implies d[f{x),f{c)) < £. So take M such that for all n>M, 
we have d{xn,c) < 5, then j(/(jc„), /(c)) < e. Hence {f {xn)} converges to /(c). 

On the other hand suppose that / is not continuous at c. Then there exists an e > 0, such that 
for every n E'H there exists an jc„ E X, with J(x„, c) < i/« such that d(yf{xn),f{c)) > £. Then {xn} 
converges to c, but {f{xn) } does not converge to /(c) . □ 

Example 7.5.3: Suppose /: — ;> M is a polynomial. That is, 

d d-j 

l^ajkX^y = aoo + aiox + aoiy + a2ox +auxy + ao2y H haorfj , 

j^Ok=0 

for some d eN (the degree) and aji^ E M. Then we claim that / is continuous. Let {{xn,yn)}n=i be 
a sequence in that converges to {x,y) E M?. We have proved that this means that limx„ = x and 
\imyn = y. So by Proposition 2.2.5 we have 

d d-j d d-j 

lim f{xn,yn) = lim Y YajkX^y^^ = E E ^j^^V = /(-«,j)- 

j=Ok^O j=Ok=0 

So / is continuous at {x,y), and as {x,y) was arbitrary / is continuous everywhere. Similarly, a 
polynomial in n variables is continuous. 
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7.5.2 Compactness and continuity 

Continuous maps do not map closed sets to closed sets. For example, /: (0, 1) — )• M defined by 
f{x) := X takes the set (0, 1), which is closed in (0, 1), to the set (0, 1), which is not closed in M. 
On the other hand continuous maps do preserve compact sets. 

Lemma 7.5.4. Let {X,dx) and {Y^dy) be metric spaces and f: X a continuous function. If 
K <zX is a compact set, then f{K) is a compact set. 

Proof. A sequence in f{K) can be written as {/(x„)}^^j, where {jc„}^^j is a sequence in K. The 
set K is compact and therefore there is a subsequence {xnj}Y=i that converges to some x e K.By 
continuity, 

\imfix„,)=f{x)ef{K). 

So every sequence in f{K) has a subsequence convergent to a point in f{K), and f{K) is compact 
by Theorem 7.4.8. □ 

As before, / : X — t- M achieves an absolute minimum at c G X if 

f{x)>f{c) foralljcGX. 

On the other hand, / achieves an absolute maximum at c G X if 

/W < /(c) for all x^X. 

Theorem 7.5.5. Let (X^d) be a compact metric space and f: X — )■ M a continuous function. Then 
f is bounded and in fact f achieves an absolute minimum and an absolute maximum on X. 

Proof. As X is compact and / is continuous, we have that f{X) C M is compact. Hence f{X) is 
closed and bounded. In particular, sup/(X) G f{X) and inf/(X) G f{X). That is because both the 
sup and inf can be achieved by sequences in f{X) and f{X) is closed. Therefore there is some 
X eX such that f{x) = sup/(X) and some y EX such that f{y) = inf/(X). □ 

7.5.3 Continuity and topology 

Let us see how to define continuity in terms of the topology, that is, the open sets. We have already 
seen that topology determines which sequences converge, and so it is no wonder that the topology 
also determines continuity of functions. 

Lemma 7.5.6. Let (X,dx) and {Y.dy) be metric spaces. A function f:X-^Yis continuous at 
c eX if and only if for every open neighborhood U of f{c) in Y, the set f^^{U) contains an open 
neighborhood ofc in X. 



232 



CHAPTER?. METRIC SPACES 



Proof. Suppose that / is continuous at c. Let U be an open neighborhood of /(c) in Y, then 
By (/(c), e) C U for some £ > 0. As / is continuous, then there exists a 5 > such that whenever 
X is such that dx{x,c) < 5, then dy (/(jc),/(c)) < £. In other words, 

Bx{c,5)cf-\By{f{c),e))cf-\u). 

and Bx{c, 5) is an open neighborhood of c. 

For the other direction, let £ > be given. If /^ ^ (5y (/(c) , £) ) contains an open neighborhood 
W of c, it contains a ball. That is, there is some 5 > such that 

Bx{c,d)cWcf-'{By{f{c),e)). 

That means precisely that if dx {x, c) < 5 then dy {f{x) , /(c)) < £ and so / is continuous at c. □ 

Theorem 7.5.7. Let {X,dx) and {Y,dy) be metric spaces. A function f : X ^ Y is continuous if 
and only if for every open U CY, f^^{U) is open in X. 

The proof follows from Lemma 7.5.6 and is left as an exercise. 

Example 7.5.8: Theorem 7.5.7 tells us that if E is closed, then f^^{E) = X \f^^{E^') is also 
closed. Therefore if we have a continuous function /: X — )■ M, then the zero set of /, that is, 
/^^ (0) = G X : f{x) = 0}, is closed. An entire field of mathematics, algebraic geometry, is the 
study of zero sets of polynomials. 

Similarly the set where / is nonnegative, that is, /^^ ([0,°°)) = {x eX : f{x) > 0} is closed. 
On the other hand the set where / is positive, /^^ ((0, oo)) = {x G X : f{x) > 0} is open. 

7.5.4 Exercises 

Exercise 7.5.1: Consider N C M with the standard metric. Let {X,d) be a metric space and f: X — t- N a 
continuous function, a) Prove that ifX is connected, then f is constant (the range of f is a single value), b) 
Find an example where X is disconnected and f is not constant 

Exercise 7.5.2; Letf:M.^^R be defined by /(0,0) := 0, and f{x,y) := if {x,y) (0,0). a) Show 
that for any fixed X, the function that takes y to f{x,y) is continuous. Similarly for any fixed y, the function 
that takes x to f{x,y) is continuous, b) Show that f is not continuous. 

Exercise 7.5.3: Suppose that f: X is continuous for metric spaces {X,dx) and (F,(iy). Let A <ZX. a) 
Show that f{A) C /(A), b) Show that the subset can be proper. 

Exercise 7.5.4; Prove Theorem 7.5.7. Hint: Use Lemma 7.5.6. 

Exercise 7.5.5: Suppose that f: X ^Y is continuous for metric spaces {X,dx) and (Yjdy). Show that ifX 
is connected, then f{X) is connected. 
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Exercise 7.5.6: Prove the following version of the intermediate value theorem. Let {X,d) be a connected 
metric space and f:X^Ma continuous function. Suppose that there exist xq,xi GX and y GM such that 
/(xo) < 3^ < f{xi ). Then prove that there exists a zGX such that f{z) = y. Hint: See Exercise 7.5.5. 

Exercise 7.5.7: A continuous function /: X — )• Y for metric spaces {X,dx) and (Y,dY) is said to be proper 
if for every compact set K CY, the set f^ ^ {K) is compact. Suppose that a continuous / : (0, 1 ) — )• (0, 1 ) w 
proper and {x„} is a sequence in (0, 1) that converges to 0. Show that {/(x„)} has no subsequence that 
converges in (0, 1). 

Exercise 7.5.8: Let (X,dx) and (Yjdy) be metric space and f: X ^Y be a one-to-one and onto continuous 
function. Suppose that X is compact. Prove that the inverse f^^ '■ Y ^ X is continuous. 

Exercise 7.5.9: Take the metric space of continuous functions C([0, 1],M). Let k: [0, 1] x [0, 1] ^ M be a 
continuous function. Given f € C([0, 1],M) define 



a) Show that T{f) := (pf defines a function T : C([0, 1],M) — t- C([0, b) Show that T is continuous. 

Exercise 7.5.10: Let {X,d) be a metric space, a) If p G X, show that f: X — t- M defined by f{x) := d{x,p) 
is continuous, b) Define a metric onX xX as in Exercise 7.1.6 part b, and show that g: X x X — )• R defined 
by g{x,y) '.= d{x,y) is continuous, c) Show that ifK\ and K2 are compact subsets ofX, then there exists a 
p G K[ and q £ K2 such that d{p,q) is minimal, that is, d{p,q) = inf{ {x,y) : x £ Ki,y £ K2}. 
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7.6 Fixed point theorem and Picard's theorem again 

Note: 1 lecture ( optional, does not require §6.3) 

In this section we prove the fixed point theorem for contraction mappings. As an application we 
prove Picard's theorem, which we proved without metric spaces in §6.3. The proof we present here 
is similar, but the proof goes a lot smoother with metric spaces and the fixed point theorem. 

Definition 7.6.1. Let {X,d) and {X',d') be metric spaces. /: X X' is said to be a contraction 
(or a contractive map) if it is a /r-Lipschitz map for some k < I, i.e. if there exists ak < I such that 

d'{f{x)J{y)) <kd{x,y) forallx,yGX. 

If/: X — 7- X is a map, x G X is called a fixed point if f{x) = x. 

Theorem 7.6.2 (Contraction mapping principle or Fixed point theorem). Let {X,d) be a nonempty 
complete metric space and f: X ^ X a contraction. Then f has a fixed point. 

The words complete and contraction are necessary. See Exercise 7.6.6. 
Proof. Pick any xq EX. Define a sequence by Xn+i := f{x„). 

d{Xn+uXn) =d[f{Xn)J{Xn-l)) < kd{Xn,Xn^l) < ■ < k"d{xi,XQ). 

Suppose m > n, then 

m—l 

d{Xm-,Xn) < ^ d{Xi^l,Xi) 
i=n 

m—l 

< k'd{xuxo) 

i=n 

= k"d{xi,xo) ^ k' 

OO -j^ 

< k'^d(xi,xo)Y,k' =k"d{xi,xo)- — -. 

,=0 ^ 

In particular the sequence is Cauchy (why?). Since X is complete we let jc := lim Xn and we claim 
that X is our unique fixed point. 

Fixed point? Note that / is continuous because it is a contraction (why?). Hence 

f{x) = lim f{xn) = limx^+i = x. 

Unique? Let j be a fixed point. 

d{x,y) =d{f{x)J{y)) =kd{x,y). 

As k < I this means that d{x,y) = and hence x = y. The theorem is proved. □ 
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Note that the proof is constructive. Not only do we know that a unique fixed point exists. We 
also know how to find it. Let us use the theorem to prove the classical Picard theorem on the 
existence and uniqueness of ordinary differential equations. 

Consider the equation 

Given some xo.yo we are looking for a function f{x) such that f'{xo) = yo and such that 

/ix)=F{xJ{x)). 

There are some subtle issues, for example how long does the solution exist. Look at the equation 
y = = 1- Then y{x) = is a solution. While F is a reasonably "nice" function and in 

particular exists for all x and y, the solution "blows up" atx= 1. For more examples related to 
Picard's theorem see §6.3. 

Theorem 7.6.3 (Picard's theorem on existence and uniqueness). Let I, J ClS.be compact intervals, 
let Iq and Jq be their interiors, and let (xq, Jo) ^ ^ x -^o- Suppose F : I x J -^M.is continuous and 
Lipschitz in the second variable, that is, there exists anLcM. such that 

\F{x,y) —F{x,z) \ <L\y — z\ for ally, zE J, xel. 

Then there exists an h> and a unique differentiable function f: [xq — h,xo + h] ^ J G M., such 
that 

f'{x) =F{xJ{x)) and f{xo)=yo. 

Proof. Without loss of generality assume xq = 0. As / x 7 is compact and F{x,y) is continuous, 
it is bounded. Let M := sup{\F{t,x)\ : {t,x) E I x J}. Pick a > such that [-«,«] C / and 
\yo-a,yo + a] cJ. Let 

• / « \ 

h := min < a, > . 

Note [-h,h] C /. Define the set 

Y := {f eC{[~h,hlR) : f{[-h,h]) c [.^o - a,xo + a]}. 

Here C( [—/?,/?], M) is equipped with the standard metric d{f,g) := sup{|/(x) — g{x) \ : x E [—h,h]}. 
With this metric we have shown in an exercise that C( [—/?,/?], M) is a complete metric space. 

Exercise 7.6.1: Show that Y C C([— /z,/z],M) is closed. 

Define a mapping T : 7 — )■ C{[—h, /i], M) by 

T{f){x) ■.= yQ + J^'F{t,f{t)) dt. 
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Exercise 7.6.2; Show that if f'. [—h,h] — ?■ J is continuous then F{t,f{t)^ is continuous on [—h,h\ as a 
function oft. Use this to show that T is well defined and that T{f) G C([— /z,/2],M). 



Let / G y and \x\ <h. As F is bounded by M we have 



T{f){x)-yo\= I^F{tJit))dt 




Therefore, T{Y) C Y. We can thus consider T as a mapping of Y to Y. 
We claim T is a contraction. First, for x G [—h,h] and G 7 we have 

\F{x, fix)) - F {x, gix)) I < L \fix) -g{x)\<Ld{f,g). 



Therefore, 




left hand side above to obtain J (r(/), r(^)) < jfftdd{f,g). 

We apply the fixed point theorem (Theorem 7.6.2) to find a unique / G 7 such that T{f) = f, 
that is. 



Exercise 7.6.3: We have shown that f is the unique function in Y. Why is it the unique continuous function 
f : [— /i, /i] — )• 7 that solves (7.5) above? 



7.6.1 Exercises 

Exercise 7.6.4: Suppose X = X' = M with the standard metric. Let 0<^<1, Z?gM. a) Show that the map 
F{x) = kx + b is a contraction, b) Find the fixed point and show directly that it is unique. 

Exercise 7.6.5: Suppose X =X' = [0, 1/4] with the standard metric, a) Show that f{x) = x^ is a contraction, 
and find the best (smallest) k that works, b) Find the fixed point and show directly that it is unique. 

Exercise 7.6.6: a) Find an example of a contraction of non-complete metric space with no fixed point, b) 
Find a 1-Lipschitz map of a complete metric space with no fixed point 

Exercise 1.6.1: Consider y' = y^, y{0) = 1. Start with fo{x) = 1. Find a few iterates (at least up to fj). 
Prove that the limit of fn is j^. 




□ 
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